Using systematic reviews and meta-analyses to support regulatory decision making for neurotoxicants: lessons learned from a case study of PCBs.
BACKGROUND: Epidemiologic weight-of-evidence reviews to support regulatory decision making regarding the association between environmental chemical exposures and neurodevelopmental outcomes in children are often complicated by lack of consistency across studies.OBJECTIVE: We examined prospective cohort studies evaluating the relation between prenatal and neonatal exposure to polychlorinated biphenyls (PCBs) and neurodevelopment in children to assess the feasibility of conducting a meta-analysis to support decision making.
DATA EXTRACTION/SYNTHESIS: We described studies in terms of exposure and end point categorization, statistical analysis, and reporting of results. We used this evaluation to assess the feasibility of grouping studies into reasonably uniform categories.
RESULTS: The current literature includes 11 cohorts of children for whom effects from prenatal or neonatal PCB exposures were assessed. The most consistently used tests included Brazelton's Neonatal Behavioral Assessment Scale, the neurologic optimality score in the neonatal period, the Bayley Scales of Infant Development at 5-8 months of age, and the McCarthy Scales of Children's Abilities in 5-year-olds. Despite administering the same tests at similar ages, the studies were too dissimilar to allow a meaningful quantitative examination of outcomes across cohorts.
CONCLUSIONS: These analyses indicate that our ability to conduct weight-of-evidence assessments of the epidemiologic literature on neurotoxicants may be limited, even in the presence of multiple studies, if the available study methods, data analysis, and reporting lack comparability. Our findings add support to previous calls for establishing consensus standards for the conduct, analysis, and reporting of epidemiologic studies in general, and for those evaluating the effects of potential neurotoxic exposures in particular.
KEY WORDS: domain, function testing, meta-analysis, neurodevelopment, neurotoxicants, PCBs, risk assessment, weight of evidence. Environ. Health Perspect 118:727-734 (2010). doi:10.1289/ehp.0901835 [Online 22 February 2010]
**********
Extensive literature exists on the use and interpretation of neurodevelopmental tests that serve as outcome measures in population studies examining effects of environmental exposures. However, conclusions about the presence or absence of a causal relation between an exposure to a specific toxicant and a particular outcome are generally based on weight of evidence (WOE), because even well-designed studies are subject to methodologic limitations that are unavoidable in observational research; no single study can be considered sufficient for producing definitive results. For this reason, it is crucial that the scientific and regulatory communities are able to evaluate findings across studies before rendering the WOE-based conclusions. The term "WOE" has several possible definitions; we refer to WOE as a methodology with a "simple premise: that all available evidence should be examined and interpreted" (Weed 2005). It is important to clarity that for the purposes of this review we focus on weight of epidemiologic (vs., e.g., toxicologic) evidence. Such WOE evaluation is possible only if the studies under review use the same or similar methods of exposure assessment, outcome ascertainment, data analysis, and reporting of results.
To provide a methodologic framework for a review of the association between in utero and early-life exposures to environmental chemicals and neurodevelopmental outcomes in children, we considered epidemiologic studies that focused on prenatal and neonatal exposure to polychlorinated biphenyls (PCBs). Our selection of PCBs as the exemplar chemical class was based on two main considerations. First, it was important to select an environmental chemical or chemical class for which a sufficient body of peer-reviewed literature was available for evaluation. Scientific studies on PCBs and neurodevelopment date back to the early 1980s and include cohorts from several countries. Second, for the purposes of this examination of neurodevelopmental epidemiologic studies and implications for interstudy comparison, we sought to select a chemical or chemical class for which substantial uncertainty exists regarding presence or absence of a causal relation between prenatal/neonatal exposure and neurodevelopmental outcomes. With respect to PCBs, recent reviews appear to indicate considerable disagreement among experts (Boucher et al. 2009; Cicchetti et al. 2004; Kimbrough and Krouskas 2003; Schantz et al. 2003), and controversy exists as to whether PCBs at current environmental levels of exposure are in fact neurotoxicants (Winneke et al. 2003). Although others have provided reviews of the PCB neurodevelopment literature (Boucher et al. 2009; Kimbrough and Krouskas 2003), their value for WOE is weakened by differing and sometimes idiosyncratic matching of neurodevelopmental assessment instruments to putative neurodevelopmental domains and by a lack of formal assessment of consistency across studies addressing the same exposure--outcome associations for the same or similar study populations.
The purpose of this review is not to weigh in on the ongoing debate over neurodevelopmental effects of PCBs. Instead, we used studies of PCBs as a vehicle for evaluating the state of the science in population research aimed at investigating the relation between prenatal or neonatal exposures to environmental chemicals and performance on neurodevelopmental function tests. In this review, we present results from our assessment of the epidemiologic literature on the relation between PCBs and neurodevelopment regarding a) the consistency of study methods with respect to exposure assessment, outcome ascertainment, and data analysis; and b) the feasibility of conducting a quantitative WOE assessment of existing epidemiologic data (i.e., a meta-analysis). The goals are to develop a general framework for assessing the body of evidence in neurodevelopmental environmental epidemiology studies and to offer recommendations to guide future research such that results will be more amenable to WOE reviews in support of regulatory decision making.
Methods
Identification/selection of studies. We used several electronic data sources [PubMed (http://www.ncbi.nlm.nih.gov/pubmed), Cochrane Library (http://www.thecochranelibrary.com), EMBASE (http://www.embase.com/home), PsycINFO (http://www.apa.org/pubs/databases/psycinfo/index.aspx), and Web-of-Knowledge (http://apps.isiknowledge.com)] to conduct the initial literature search, with an end date of December 2009. Using keywords "polychlorinated," "biphenyls," "PCB," "PCBs," "children," "prenatal," "neurodevelopmental," and "neurobehavioral," as well as various combinations of these keywords, we selected relevant articles that investigated the neurodevelopmental effects of environmental PCB exposures in children (poisoning events were not considered). We reviewed secondary references of retrieved articles to identify publications not captured by the electronic search. We conducted additional literature searches to identify relevant reports and textbook chapters that were not published in the peer-reviewed literature.
The prospective longitudinal design provides the most informative data for examining outcomes associated with in utero and early-life exposures (Amler et al. 2006; Wigle et al. 2008). For this reason, our search of the literature focused specifically on cohort studies that recruited participants either prenatally or soon after birth and linked various measures of pre-and postnatal PCB exposures to neurodevelopmental outcomes at different ages; at the time of this review, some of the studies had conducted only one neurodevelopmental evaluation.
Literature review. We retrieved and reviewed the publications identified via the literature search (~ 60 articles) and extracted information on each relevant study with respect to its methods of data collection, analysis and reporting. Extracted information was categorized according to the following characteristics: a) cohort description--year of enrollment, geographic location, and ages at which neurodevelopmental/neurobehavioral tests were administered; b) exposure categorization--whether information was based on maternal dietary questionnaires or measured (e.g., in breast milk, maternal serum, or cord blood) and units of measures [e.g., nanograms per gram, parts per billion, or toxic equivalents (TEQs)]; c) tests used to define the end points of interest--neurologic [e.g., neurologic optimality scares (NOSs)], cognitive [e.g., Bayley Scales of Infant Development (BSID)], or other tests assessing specific domains of functioning; and d) analysis and reporting of result--linear regression coefficients with and without log transformation of variables, parametric or nonparametric comparisons of outcomes in two or more groups, or qualitative description of results.
This characterization of the cohort studies allowed us to search for reasonably homogeneous groups of articles that could then be included in a systematic analysis. Within each group, we assessed the feasibility of a meta-analysis of the published data. It is a common practice that a minimum data set needed for the systematic analysis should include at least three similar studies, in which measures of effect and corresponding measures of variance for the same exposure--outcome association within the same age group either were reported by the study authors or could be calculated using the data from the original articles (Treadwell et al. 2006).
Results
Overview of PCB cohort studies. The current published literature includes 11 cohort studies of children for whom pre-or neonatal PCB exposures were measured (as maternal blood levels during pregnancy, cord blood, breast milk concentrations, or combinations of these) or estimated. These studies represent a wide range of populations (in terms of geography and year of enrollment) recruited either at birth or prenatally, some of which were followed for several years (up to 11 years for one cohort), Geographically, five of the cohorts were recruited in the United States and Canada, five in Europe, and one in Japan. The neurodevelopmental outcomes at various ages were described in at least 40 different articles with publication dates spanning a 26-year interval from 1984 through 2009. Figure 1 summarizes tests administered in each cohort study at different ages through the seventh year of life. [Figure 1 also includes a twelfth cohort--the Pregnancy, Infection, and Nutrition Babies Study (Pan et al. 2009)--that has reported results for only one neurodevelopmental function testing period to date and used function tests that differed from those used for the other cohort studies. This cohort is not discussed further.] Most (9 of 11) cohorts were evaluated for neurologic and behavioral function or cognitive ability during the first year of life. After the first year of age, the frequency of testing decreased. Importantly, although not shown in Figure 1, after 8 years of age the available neurodevelopmental data become even more sparse.
Age at follow-up 0-5 months 6-11 months 12-23 months Collaborative Bayley Scales of Perinatal Project, Infant USA Development Michigan, USA Bayley Fagan Test of Scales of Infant Infant Intelligence Development Brazelton Neonatal Behavioral Assessment Scale (NBAS) North Carolina, USA Bayley Bayley Scales of Bayley Scales Scales of Infant of Infant Infant Development Development Development Brazelton Neonatal Behavioral Assessment Scale (NBAS) Pregnancy, Infection, Mullen Scales of and Nutrition Babies Early Learning; Study, USA MacArthur-Bates Communicative Development Indices Oswego, USA Brazelton Fagan Test of Fagan Test of Neonatal Infant Infant Behavioral Intelligence Intelligence Assessment Scale (NBAS) Nunavik, Canada Dusseldorf, Germany Bayley Scales of Bayley Scales Infant of Infant Development Development Fagan Test of Infant Intelligence Duisburg, Germany Neurologic Bayley Scales Optimality of Infant Score Development (Prechtl, Neurologic Hempel) Optimality Score (Prechtl, Hempel) Faroe Islands 1 Faroe Islands 2 Neurologic Optimality Score (Prechtl, Hempel) Groningen/Rotterdam, Bayley Bayley Scales of Bayley Scales the Netherlands Scales of Infant of Infant Infant Development Development Development Neurologic Neurologic Optimality Optimality Score (Prechtl, Score Hempel) (Prechtl, Hempel) Hokkaido, Japan Bayley Scales of Infant Development Age at follow-up 24-35 36-47 months 48-59 months months Collaborative Stanford-Binet Perinatal Project, IQ test USA Michigan, USA McCarthy Scales of Children's Abilities Miscellaneous tests including Peabody, Beery, Kagan, Sternberg, Wide Range Achievement Test, individual subsets of Wechsler Intelligence Scale for Children (WISC), Bender Visual Motor Gestault Test, Boston Naming Test, etc. North Carolina, USA Bayley McCarthy Scales McCarthy Scales Scales of of Children's of Children's Infant Abilities Abilities Development Pregnancy, Infection, and Nutrition Babies Study, USA Oswego, USA McCarthy Scales McCarthy Scales of Children's of Children's Abilities Abilities MCC Michigan Catch the Cat Nunavik, Canada Multiple neuromotor tests Dusseldorf, Germany Bayley Kaufman Scales of Assessment Infant Battery for Development Children Duisburg, Germany Bayley Scales of Infant Development Faroe Islands 1 Faroe Islands 2 Groningen/Rotterdam, Neurologic the Netherlands Optimality Score (Prechtl, Hempel) Reynell Developmental Language Scale Kaufman Assessment Battery for Children Hokkaido, Japan Age at follow-up 60-71 72-83 months 84-95 months months Collaborative Miscellaneous Perinatal Project, tests including USA Peabody, Beery, Kagan, Sternberg, Wide Range Achievement Test, individual subsets of Wechsler Intelligence Scale for Children (WISC), Bender Visual Motor Gestault Test, Boston Naming Test, etc. Michigan, USA North Carolina, USA McCarthy Scales of Children's Abilities Pregnancy, Infection, and Nutrition Babies Study, USA Oswego, USA Nunavik, Canada Multiple Multiple neuromotor neuromotor tests tests Dusseldorf, Germany Kaufman Assessment Battery for Children Duisburg, Germany Faroe Islands 1 Miscellaneous tests including Peabody, Beery, Kagan, Sternberg, Wide Range Achievement Test, individual subsets of Wechsler Intelligence Scale for Children (WISC), Bender Visual Motor Gestault Test, Boston Naming Test, etc. Faroe Islands 2 Groningen/Rotterdam, McCarthy Scales Miscellaneous the Netherlands of Children's tests including Abilities Peabody, Beery, Kagan, Sternberg, Wide Range Achievement Test, individual subsets of Wechsler Intelligence Scale for Children (WISC), Bender Visual Motor Gestault Test, Boston Naming Test, etc. Hokkaido, Japan Figure 1. Summary of tests used in the PCB cohorts by age (for children up to 7 years of age at follow-up), Collaborative Perinatal Project, USA (Daniels et al. 2003); Michigan, USA (Jacobson et al, 1984); North Carolina, USA (Rogan et al, 1986); Pregnancy, Infection, and Nutrition Babies Study, USA, (Pan et al. 2009); Oswego, USA (Lonky et al. 1996); Nunavik, Canada (Despres et al. 2005); Dusseldorf, Germany (Winneke et al. 1998); Duisburg, Germany (Wilhelm et al. 2008); Faroe Islands 1 (Grandjean et al. 2001); Faroe Islands 2 (Steuerwald et al. 2000); Groningen/Rotterdam, the Netherlands (Huisman et al. 1995a); Hokkaido, Japan (Nakajima et al. 2006). Additional testing was conducted with the Oswego cohort at 8 and 9 years of age, the Netherlands cohort at 9 years of age, and the Michigan cohort at 11 years of age. Note: Table made from bar graph.
Feasibility of quantitative analysis. Our review of each cohort summarized in Figure 1 showed that the opportunities for a WOE review and/or meta-analysis of studies that used the same tests among children of the same or similar age appear to be most promising in the first and the fifth years of life. As noted in "Methods," our goal was to identify reasonably homogeneous groups of at least three studies. Studies were considered eligible for a meta-analysis if a) similar tests were administered at similar ages, b) exposure was measured and reported in comparable ways, c) results represented comparable measures of effect, and d) for the purposes of weighting in a meta-analysis, measures of effect were accompanied by corresponding measures of variance.
The earliest opportunity to assess consistency of findings across studies in terms of participants' age was presented in the neonatal period (i.e., within 28 days postpartum). Studies conducted in the United States and in Europe used different types of testing to examine neurobehavioral function in newborns. As shown in Table 1, the three U.S. studies included the Michigan cohort (Jacobson et al, 1984), the Oswego cohort (Lonky et al. 1996; Stewart et al. 2000), and the North Carolina cohort (Rogan et al. 1986). All of the U.S. studies administered Brazelton's Neonatal Behavioral Assessment Scale (NBAS), which was divided into seven clusters. Six of those clusters--response decrement, orientation, tonicity, range of state, regulation of state, and autonomic maturity--are considered behavioral. One cluster--reflex--is aimed at evaluating neurologic function.
Table 1. U.S. studies of neurologic and behavioral function among newborns using the Brazelton NBAS (a) in relation to measures of prenatal/early-life PCB exposure. Reference, Age at test Exposure Tests measuring cohort administration measure association Jacobson et 3 days Fish Chi-square test where al. 1984, consumption as outcome is Michigan, surrogate for dichotomized as USA PCB exposure normal vs. worrisome and exposure is defined as none vs. high Partial correlations adjusted (one at a time) for birth weight, head circumference, gestational age, and BEFM Multiple linear regression, adjusted for birth weight, head circumference, gestational age Cord serum PCBs Linear F-test (congeners not adjusted for birth specified) weight, head circumference, gestational age, and BEFM Rogan et 1-3 weeks PCB in breast Multiple linear al. 1986, milk fat regression, adjusted North (congeners not (simultaneously) for Carolina, specified) maternal age, USA education, occupation, smoking, drinking, fish intake, and anesthesia; infant's race, sex, birth weight, jaundice, age of testing, interval between testing and feeding Lonky et 1 day and 2 Fish MANCOVA with al. 1996, days consumption as principal components Oswego, New surrogate for analysis; outcome: York, USA PCB exposure the difference (b) between two assessments; exposure: three fish s consumption groups (high, low, and none) Stewart et 1 day and 2 [summation]PCB Linear F-test for al. 2000, days (69 congeners) each age (1 day and 2 Oswego, New in cord blood; days) analyzed York, USA lightly, separately across moderately, and four categories of highly highly chlorinated chlorinated PCB PCBs (nondetectable, congeners in low, medium, and cord blood; high) adjusted for fish unspecified consumption as covariates and three surrogate for fish consumption PCB exposure groups (high, low, and none); linear F-test across exposure categories where outcome is defined as percentage of poor NBAS scores Reference, cohort Results Jacobson et al. 1984, Chi-square test with level of significance Michigan, USA (e.g., < 0.1, < 0.05) Partial correlations (with level of significance, but without variance) reported for lability of states (a subset of RaS), RaS, AM, and R; other clusters not reported Regression coefficients (with level of significance, but without variance) reported for lability of states, AM and R; other clusters not reported For cord blood, text indicates no significant association Rogan et al. 1986, Results reported as significant for T and R North Carolina, (regression coefficients not reported); USA results for other clusters not reported, but text notes significant associations were found only for T and R and that for RD p-value was 0.07 Lonky et al. 1996, F-values with levels of significance Oswego, New York, (Bonferroni adjusted) reported for six USA (b) clusters, all except RD; direction/magnitude of difference reported for R, AM, and RD across three levels of fish consumption (but without variance) Stewart et al. 2000, Results for total [summation]PCB and lightly Oswego, New York, and moderately chlorinated PCBs not reported; USA results for highly chlorinated PCBs at 1 day noted as showing no differences; results for highly chlorinated PCB at 2 days given as linear term (F-statistic) for R, AM, and RD with either p-values or levels of significance; results for percentage of poor NBAS scores given as F-test and a p-value for highly chlorinated cord blood PCB and fish consumption; text indicates that lightly and moderately chlorinated PCB were not related to percentage of poor NBAS scores BEFM, Ballard Examination for Fetal Maturity. (a) Brazelton NBAS clusters: response decrement (RD), orientation, tonicity (T), range of state (RaS), regulation of state (ReS), autonomic maturity (AM), and reflex (R). (b) NBAS seven clusters administered twice: at 12-25 and 25-48 hr of life.
The Michigan and Oswego cohorts were given the NBAS test within the first 3 days of life. Both studies carried out multivariate analyses to link fish consumption (as a surrogate for exposure to PCBs) to NBAS score; however, the outcome definitions for the two cohorts differed from each other. Specifically, the Michigan study (Jacobson et al. 1984) used a single NBAS result obtained on the third day of life, whereas the Oswego study (Lonky et al. 1996) defined the outcome as the difference between two assessments conducted in the second and the first day after birth. Further, the multivariate analyses in the two studies [linear regression for the Michigan study and multivariate analysis of covariance (MANCOVA) for the Oswego study] produced results that could not be compared and/or combined quantitatively. A second publication based on the Oswego cohort (Stewart et al. 2000) examined the association between NBAS and cord blood PCBs in addition to fish consumption, which was already assessed by Lonky et al. (1996). The exposure was assessed using four metrics (total PCBs, lightly chlorinated PCBs, moderately chlorinated PCBs, and highly chlorinated PCBs), and the outcomes in this study were assessed separately at each time interval (1 day and 2 days of life) both as the NBAS score for each cluster and as an overall proportion of poor scores. The data were analyzed using a test for trend statistic; however, the quantitative results were reported only for the second day of life assessment and only for highly chlorinated PCBs.
The third study (the North Carolina cohort) that administered NBAS did so between the first and third week of life (Rogan et al. 1986). The analytic approach (linear regression) used in the North Carolina study was similar to that of the Michigan study (Jacobson et al. 1984), but the exposure measures differed (PCBs measured in breast milk were compared with estimates of PCB exposure based on fish consumption information or cord blood levels). In addition, the results were presented in terms of p-values without reporting the regression coefficients. Thus, despite the consistent use of NBAS in the first week of life by these three cohorts, differences in methods for estimating exposures and in reporting of outcomes preclude conducting a quantitative systematic review across the cohorts. It is worth noting that even if the statistical method had been consistent across studies, the differences in choice of covariates would still have rendered it very difficult to synthesize the effect sizes across studies (Lipsey 2001).
The three European studies of neonatal outcomes (Table 2) were conducted in Duisburg, Germany (Wilhelm et al. 2008), the Netherlands (Huisman et al. 1995a), and the Faroe Islands (Steuerwald et al. 2000). All three studies used the NOS, a combined measure that consists of 60 components with an optimal range of results predefined for each item, with the final score calculated as the total number of optimal items (Touwen et al. 1980).
Table 2. European studies that examined neurologic and behavioral function among newborns using the NOS in relation to measures of prenatal or early-life PCB exposure. Reference, Age at test Exposure Tests measuring cohort administration measure association Huisman et 10-21 days Individual, Logistic regression al. 1995a, [summation], and with outcome the TEQs: planar dichotomized as Netherlands PCBs (3 optimal vs. congeners) and nonoptimal and nonplanar PCBs results expressed (up to 23 as odds ratios for congeners) in doubling of breast milk fat; exposure adjusted individual and for maternal age, [summation]PCBs study center, (4 congeners) in alcohol use maternal plasma and cord plasma Steuerwald 2 weeks [summation]PCBs Linear regression et al. 2000, (three where exposure and Faroe congeners) in outcome were Islands breast milk, expressed as lipid-adjusted continuous [summation]PCBs variables (3 congeners) in controlling for Hg maternal serum and other (not specified) covariates Wilhelm et 2 weeks PCDD/F + PCB Linear regression al. 2008, TEQs (congeners presented as MD, Duisburg, not specified) where MD--1 = Germany in maternal [beta] for doubling blood and breast of exposure milk lipid adjusted for maternal age, gestational age, birth weight, alcohol intake, age of examination, maternal Pb and Hg exposure, and examiner Reference, cohort Results Huisman et al. 1995a, Reported as odds ratios and corresponding the Netherlands 95% confidence intervals Steuerwald et al. 2000, Text indicates weak positive association; Faroe Islands regression results not reported, Spearman's correlation coefficients reported (but without variance) Wilhelm et al. 2008, Reported as linear regression coefficients Duisburg, Germany and corresponding 95% confidence intervals Abbreviations: Hg, mercury; MD, mean differences; Pb, lead.
All three of these studies conducted the NOS assessment between 1 and 3 weeks of life and in that respect are comparable to the North Carolina cohort. Two of the three European studies--the Duisburg (Wilhelm et al. 2008) and Faroe Islands (Steuerwald et al. 2000) cohorts--performed linear regression analyses to examine the relation of NOS scores at 2 weeks of life to PCB levels in both milk and maternal blood samples; however, quantitative results are given only for the Duisburg cohort (Wilhelm et al. 2008). In addition, different analytes were selected for exposure assessment in these two studies. Wilhelm et al. (2008) examined PCBs together with polychlorinated dibenzo-p-dioxins and dibenzofurans (PCDD/Fs), whereas Steuerwald et al. (2000) expressed exposure as [summation]PCB (the sum of PCB congeners), the Netherlands cohort (Huisman et al. 1995a) dichotomized the NOS using the median PCB concentration in the study population as the cutoff. For the resulting binary outcome in the logistic regression analyses, the independent variable of interest was the log-transformed [summation]PCB and various [summation]PCB subsets (e.g., planar versus nonplanar). Again, despite the availability of three studies using the same neurodevelopmental test, differences in methods for estimating exposures and in reporting of outcomes preclude conducting a quantitative systematic review.
Six cohort studies used the same test--BSID--to assess the cognitive function of their participants between 5 and 8 months of age and thus could provide comparable data (Table 3). Three of these studies were conducted in the United States. The Michigan and the North Carolina cohort studies (Gladen et al. 1988; Jacobson et al. 1986) have been discussed previously in the context of neonatal assessment. The third U.S. study (Daniels et al. 2003) represents a multicenter effort--called the Collaborative Perinatal Project--that recruited participants from several sites (Baltimore, MD; Boston, MA; Buffalo, NY; Memphis, TN; Minneapolis, MN; New Orleans, LA; New York, NY; Philadelphia, PA; Portland, OR; Providence, RI; Richmond, VA). Among the three European studies shown in Table 3, two were carried out using the same cohort in Dusseldorf, Germany (Walkowiak et al. 2001; Winneke et al. 1998), and one was conducted using a subset of the previously discussed cohort of children from the Netherlands (Koopman-Esseboom et al. 1996). One additional study in this category was performed with a cohort of children from Sapporo, Japan (the Hokkaido Study on Environment and Children's Health) (Nakajima et al. 2006).
Table 3. Studies using MDI and PDI (BSID) at 5-8 months of age in relation to measures of prenatal or early-life PCB exposure. Reference, cohort Age at test Exposure measure administration Jacobson et al. 5 months Fish consumption as 1986, Michigan, surrogate for PCB exposure; USA cord blood PCBs (congeners not specified); breast milk PCBs (congeners not specified) Gladen et al. 1988, 6 months PCB in breast milk fat North Carolina, (congeners not specified) at USA birth used as measure of transplacental exposure; estimated cumulative exposure to PCB in breast milk (from birth to the age of test) Koopman-Esseboom et 7 months Prenatal exposure; al. 1996, the [summation]PCBs (24 Netherlands congeners) in maternal blood drawn in the last month of pregnancy; postnatal exposure: [summation]PCBs (4 congeners and 24 congeners) in breast milk Winneke et al. 7 months [summation]PCBs (three 1998, (a) congeners) in cord blood and Dusseldorf, breast milk Germany Walkowiak et al. 7 months [summation]PCBs (three 2001, (a) congeners) in breast milk Dusseldorf, and cord serum Germany Daniels et al. 8 months [summation]PCBs (11 2003, Collaborative congeners) in maternal blood Perinatal Project, drawn in the third USA trimester Nakajima et al. 6 months [summation]PCBs (total 14 2006, Hokkaido, congeners and by different Japan categories of congeners) and TEQs in maternal blood drawn in the third trimester of pregnancy or after delivery Reference, cohort Tests measuring Results association Jacobson et al. Linear regression and Text indicates that no 1986, Michigan, ANCOVA; details not measures of exposure USA reported were related to MDI or PDI; quantitative results not provided Gladen et al. 1988, Multiple linear Both MDI and PDI North Carolina, regression per 1 ppm, results reported as USA adjusted for maternal regression age, race, education, coefficients, standard occupation, smoking, errors, and two-sided drinking, infant's p-values sex, gestational age, birth weight, head circumference, jaundice, duration of breast-feeding, number of older siblings, abnormal reflexes on NBAS age of testing, and center/examiner Koopman-Esseboom et Multiple linear Result for MDI al. 1996, the regression expressed reported as regression Netherlands per In(ng/g) adjusted coefficients, standard for gestational age, errors, and two-sided parity, HOME score, p-values; results for and duration of PDI not reported breast-feeding Winneke et al. Multiple linear Both MDI and PDI 1998, (a) regression expressed results for both cord Dusseldorf, per ng/g adjusted for blood and breast milk Germany maternal age, PCB levels reported as education, vocabulary, regression smoking and drinking, coefficients, standard duration of errors, and one-sided breast-feeding, birth p-values weight, HOME score, Apgar score, cord blood lead level, and neonatal illness Walkowiak et al. Multiple linear For breast milk PCBs, 2001, (a) regression per both MDI and PDI Dusseldorf, [log.sub.2](ng/g) results reported as Germany adjusted for parity, regression smoking, and body mass coefficients, standard index errors, and one-sided p-values; for cord blood PCBs, text indicates that associations were small or positive; quantitative results not provided Daniels et al. Multiple linear Both MDI and PDI 2003, Collaborative regression per [mu]g/L results reported as Perinatal Project, adjusted for research regression USA center, maternal coefficients, standard education, errors, and two-sided triglycerides, p-values cholesterol, and birth order of the child Nakajima et al. Multiple linear Both MDI and PDI 2006, Hokkaido, regression per results reported as Japan In(ng/g) or per TEQ regression adjusted for coefficients and gestational age, two-sided p-values smoking and caffeine intake during pregnancy, and blood-sampling time HOME, Home Observation for Measurement of the Environment. (a) Two studies based on the same cohort; Winneke et al. (1998) reported BSID results at 7 months of age only, whereas Walkowiak et al. (2001) evaluated the same subjects again at 18 and 30 months of age and reported results at all three ages.
The versions of the BSID assessment used in these studies included two main scores: the Mental Development Index (MDI) and the Psychomotor Development Index (PDI). As shown in Table 3, all studies evaluating the relation between BSID in the first year of life and PCB exposure used linear regression to estimate the effect. However, the reporting and interpretation of the linear regression coefficients differed across the studies. Although we identified four studies that examined the relationship between MDI and PCB concentrations in maternal or cord blood, the results in these studies represented different measures of effect. In two of the four studies, the regression coefficients represented change in MDI per unit of PCB increase (micrograms per liter or nanograms per gram) (Daniels et al. 2003; Winneke et al. 1998); in two other studies (Koopman-Esseboom et al. 1996; Nakajima et al. 2006), the corresponding coefficients represented change in MDI per natural logarithm of exposure. Similarly, among the three studies that reported the association between MDI and PCBs in breast milk, only two (Gladen et al. 1988; Winneke et al. 1998) reported their findings as comparable regression coefficients per 1 ppm or 1 ng/g of exposure; the third study (Jacobson et al. 1986) simply noted a lack of association. Another publication that evaluated the relation between breast milk PCB levels and MDI (Walkowiak et al. 2001) used the same German cohort data as used by Winneke et al. (1998) but reported linear regression coefficients per logarithm base 2 of exposure. The results of studies for PDI at 5-8 months of life provided even less comparable information. Among the seven publications (based on six different cohort studies), only five calculated and reported regression coefficients, and only four of those studies were based on independent data. As was the case with MDI, it was not possible to identify three independent studies that could be combined in a meta-analysis because of the variability of exposure characterization and/or methods of expressing study results. Overall, even with the use of the Bayley Scales across several cohorts at similar times in life, a quantitative systematic review across cohorts was not possible.
Three cohort studies (Michigan, Oswego, and Dusseldorf) evaluated their participants at 6-7 months of age using the Fagan Test of Infant Intelligence. Although all three studies measured PCB cord blood concentrations (among other metrics), the specific congeners were different. Moreover, the association between exposure and outcome was assessed using different statistical methods: multiple linear regression that used PCB levels as a continuous variable in two studies (Jacobson et al. 1985 and Winneke et al. 1998) and an F-test for trend that used a four-level cord blood PCB categorization in the third study (Darvill et al. 2000). All three analyses appear to have controlled for different sets of confounders. Thus, the studies that administered the Fagan test were as heterogeneous as the studies that used BSID at roughly the same age.
The only remaining opportunity to assess the feasibility of conducting a meta-analysis was in a group of studies assessing cognitive function during the fifth year of life. As shown in Figure 1, three cohort studies in the United States (Michigan, Oswego, and North Carolina) evaluated the cognitive function of their participants between the fourth and the fifth birthdays using McCarthy Scales of Children's Abilities and were considered as candidates for inclusion in a meta-analysis (Gladen and Rogan 1991; Jacobson et al. 1990; Stewart et al. 2003). All three of these studies reported the results for the General Cognitive Index (GCI) of the McCarthy Scales. Only one of these studies also presented the results separately for the Verbal, Quantitative, Perceptual-Performance Memory, and Motor Scales (Jacobson et al. 1990).
Table 4 summarizes results for the three studies evaluating the association between perinatal PCB exposure and GCI. It is evident that despite testing the same hypothesis, the differences across the three studies were too pronounced to allow meaningful conclusions about the presence or absence of consistency in findings. Specifically, although the Michigan study (Jacobson et al. 1990) conducted linear regression analyses for cord blood and breast milk PCB exposures, only cord blood results were provided in their publication. The North Carolina study (Gladen and Rogan 1991) used breast milk concentrations to estimate exposure, but the data were analyzed using ANCOVA procedures, and the quantitative results were not reported. The Oswego cohort study (Stewart et al. 2003) was similar to the Michigan study in that they both estimated exposure based on PCB concentration in cord blood. However, unlike the Michigan study, the Oswego findings were presented not as regression coefficients but as linear F-test results, which divided exposure into four ordinal categories. As with our other attempts at a systematic review across cohorts, there was insufficient consistency with exposure measures and outcome reporting to conduct such a review.
Table 4. Studies using the GCI of the McCarthy Scales of Children's Abilities in the fifth year of age in relation to measures of prenatal or early-life PCB exposure. Reference, Age at test Exposure Tests measuring cohort administration measure association Jacobson et 4 years Cord blood PCBs Multiple linear al. 1990, (congeners not regression per Michigan, specified) [log.sub.x](ng/mL) USA +1 adjusted for maternal age, gravidity, and examiner (a) Breast milk PCBs Details of linear (congeners not regression modeling specified) for breast milk exposure not reported Gladen and 4-5 years PCB (congeners ANCOVA by dividing Rogan 1991, not specified) transplacental North in breast milk exposure into eight Carolina, fat at birth categories and USA used as a cumulative breast measure of milk exposure into transplacental five categories, exposure; adjusted for estimated maternal age, race, cumulative education, exposure to PCB occupation, in breast milk smoking, drinking, (from birth to infant's sex, the age of number of older test) siblings, feeding pattern Stewart et 4.5 years [summation]PCBs Linear F-test al. 2003, in cord blood across four PCB Oswego, New (17 congeners) exposure categories York, USA (nondetectable, low, medium, and high) adjusted for various combinations of (not specified) covariates Reference, cohort Results Jacobson et al. 1990, PCB result reported as a regression Michigan, USA coefficient with a two-sided p-value PCB regression; results not provided Gladen and Rogan 1991, For transplacental PCB exposure text indicates North Carolina, USA no association, quantitative results not provided; for cumulative breast milk exposure text indicates some variation but the pattern "did not suggest cause and effect"; quantitative results not provided Stewart et al. 2003, Result reported as a linear term without a Oswego, New York, USA measure of variance (all p-values marked as nonsignificant) (a) Details of log transformation are not given.
Discussion
Despite the relatively large body of literature on potential associations between early-life exposure to PCBs and adverse neurodevelopmental effects, controversy still exists over whether PCBs are in fact neurotoxicants, and to date, the U.S. Environmental Protection Agency has not established regulatory guidance values for PCBs based on neurotoxicity. Such regulatory decision making generally relies on a WOE assessment of studies, which in turn requires comparability across studies. Unfortunately, our examination of the PCB neurodevelopmental epidemiology literature found a lack of interstudy consistency. Even for age intervals examined by several research groups, presumably testing the same hypothesis, a meta-analysis of PCB studies is not possible at this time. Moreover, the frequency of evaluations decreased substantially and the data became increasingly sparse as the cohorts became older. This likely presents a missed research opportunity because testing in older children may be more reliable, and perhaps more informative with respect to the long-term prognosis (Sattler 2008).
As noted above, it is not the purpose of this review to weigh in on the ongoing debate over neurodevelopmental effects of PCBs, but rather to use the PCB neurodevelopmental epidemiology literature as the basis for describing generalizable issues related to interstudy consistency. Replication of findings, often referred to as "repeating a study," is a crucial aspect of the scientific method. Ability to repeat or reproduce a result leads to generalizable inferences, rather than merely to isolated and uncertain findings (Lindsay and Ehrenberg 1993). In the field of medical research, there is consensus that replication (or other substantiation) of clinical trials is a requirement for approval of drugs and medical devices (Berlin and Colditz 1999). Unlike testing of drugs and devices, most data generated by environmental research involving human subjects are observational in nature, and thus the conditions within a study are far less controlled. As noted in a recent review (Bellinger 2009), researchers conducting observational studies have great latitude in how exposure and outcome are measured and expressed, which methods for examining associations are employed, and which analyses among the myriad typically conducted ate reported. In this regard, the epidemiologic studies that rely on neurodevelopmental function test results as the end points of interest may be particularly affected by variability of study methods and reporting. This is attributable to the large number of available test batteries, each of which can offer different combinations of subtests (Youngstrom et al. 2010). Even within subtests, there are different scales and cutoff points for categorizing responses. If these are used and reported selectively, it can be very difficult to determine whether two different studies have demonstrated similar or conflicting results or have assessed overlapping but slightly different functions (Garabrant 2000).
Although consistency in study methods and reporting is a critical prerequisite of any WOE review, it is important to stress that consistency of methods alone is not sufficient for drawing conclusions about causation. By combining several studies, meta-analyses have an inherent ability to detect relatively small statistically significant departures from null. However, these relatively precise meta-estimates may not accurately reflect the true association unless the analyses take into consideration potential sources of systematic error that may affect reviews of the literature. One source of error that warrants consideration in any systematic review is publication bias, which can occur because studies with statistically significant positive findings are more likely to be published than are studies with null results. Publication bias has been shown to be of particular importance in observational studies (Easterbrook et al. 1991). Another closely related concept is selective reporting bias within published studies and is defined as ''selection on the basis of the results of a subset of the original variables recorded for inclusion in a publication" (Dwan et al. 2008). Consider, for instance, the Netherlands cohort study that administered neurologic testing and calculated the NOS at two different ages: 10-21 days and 18 months of age (Huisman et al. 1995a, 1995b). Although the two follow-ups tested the same hypothesis, the two statistical analyses were markedly different: logistic regression at 10-21 days and linear regression at 18 months of age. Perhaps more important, the strongest inverse association (between nonplanar PCBs and NOS) observed in newborns does not seem to have been reexamined (or at least not reported) in the 18-month-olds.
The search for sources of error in any systematic review inevitably leads to evaluation of individual study quality. Issues that need to be addressed usually include magnitude of nonparticipation or loss to follow-up, misclassification of exposure and/or outcome, and ability to control for extraneous factors, all of which may introduce bias. For example, an important method of minimizing information bias is making sure that persons administering the test (and at a later age perhaps also subjects themselves) are unaware of the participants' exposure status. Among studies summarized in this review (Tables 1-4), many indicated that they implemented blinding; however, in two instances (Gladen et al. 1988; Huisman et al. 1995a) the investigators were unaware of the results of laboratory analyses but knew which children were breast-fed; this information was used in estimating PCB exposure. In addition, several studies did not mention blinding procedures in their respective methods sections (Daniels et al. 2003; Gladen and Rogan 1991; Jacobson et al. 1986; Stewart et al. 2003; Wilhelm et al. 2008).
In the absence of comparable published information, one potential method for assessing the consistency of findings across studies would be to obtain the original data and then either compare the results using the same statistical methods or combine the data in a pooled analysis. For example, pooling of the data might be helpful in bringing together the three studies (Koopman-Esseboom et al. 1996; Daniels et al. 2003; Nakajima et al. 2006) that examined the association between prenatal maternal blood levels of PCBs and BSID scores but focused on different sets of congeners, used different modeling approaches, and controlled for different covariates. Such pooled analyses would be possible for only some of the many associations examined to date and would, of course, require the cooperation of researchers and depend on their willingness to share data. Perhaps more important, future studies of chemical exposures and neurodevelopmental outcomes must build on previous research with the aim of facilitating WOE assessments. Repeated calls for establishing consensus standards for the conduct, analysis, and reporting of epidemiologic studies have been voiced in a variety of areas of research, including those related to the effects of neurotoxicant exposures (Bellinger 2009).
WOE assessment is essential to interpreting results of epidemiology studies of neurodevelopment and chemical exposure. Yet even for chemicals that have been studied for their neurotoxicity for decades, there is still controversy over whether WOE is sufficient to state unequivocally that they are neurotoxicants, or to define the dose-response relationship. We used PCBs as a case study to highlight the need for improved inter-and intrastudy consistency in the selection of neurodevelopment function tests and domains to be evaluated, exposure assessment, and/or method of analyzing/reporting data.
Conclusions
We conclude with the following recommendations: First, although novel approaches for assessing neurodevelopment will continue to be developed and should be used, it is important that future research include measures comparable to those used by past researchers. The lack of inclusion of comparable measures will hinder our ability to conduct WOE assessments. We recommend that key individuals and international organizations determine and establish the specific comparable measures that should be included in each study. This is not intended to be a prescriptive list that would limit future investigators' novel approaches, but rather a methodologic feature that would permit future evaluations by scientists and regulators.
Similarly, future investigators will likely have new tools (or a favored tool) for assessing exposure to environmental chemicals. These include traditional exposure assessments, biomonitoring, and use of biomarkers of exposure. A standard, baseline metric of exposure should be derived that is evaluated as a minimum exposure metric for all studies (other types of exposure assessments could be conducted in addition to this baseline metric) to again allow for interstudy comparisons.
Third, although efforts are being made within certain agencies (e.g., the National Institutes of Health) to requite sharing of raw data, a broader effort is needed to ensure that study data are available for WOE assessments. This will not occur without in-place requirements (i.e., agency-required data sharing) as part of research-funding mechanisms.
In addition, selection of statistical methods for analyzing data from complex data sets has been the subject of intense and sometimes acrimonious debate (Kimbrough and Krouskas 2003). To this end, we recommend that an expert panel composed of statisticians, neurologists, psychologists, psychometricians, epidemiologists, and exposure and risk assessors from academia and government who have not been part of past environmental neurodevelopmental epidemiology studies (and can therefore bring fresh perspectives) be convened to discuss and recommend best practices.
Last, journals could facilitate progress by either accepting or requiring the archival of tables of summary statistics, such as unadjusted correlations, means, and standard deviations, perhaps augmented by a description of patterns of missing data. Some publication manuals, style guides, and other guidelines recommend the archiving of sufficient descriptive statistics to allow independent analyses of the data (American Psychological Association 2010). Techniques are available that would allow the inclusion of these summary tables in subsequent meta-analyses (Becker 1996), and they also would establish a "least common denominator" of data reporting that would still represent an advance over the current fragmented and hard to synthesize state of the literature.
We recognize that reaching agreement within the scientific community on the recommendations above will be difficult. However, we believe that without some consensus on each of these issues, out ability to truly evaluate neurodevelopmental risks associated with chemical exposures will not be possible.
REFERENCES
American Psychological Association. 2010. Publication Manual of the American Psychological Association. 6th ed. Washington, DC:American Psychological Association.
Amler RW, Barone S Jr, Belger A, Berlin CM Jr, Cox C, Frank H, et al. 2006. Hershey Medical Center Technical Workshop Report: optimizing the design and interpretation of epidemiologic studies for assessing neurodevelopmental effects from in utero chemical exposure. Neurotoxicology 27:861-874.
Becker G. 1996. The meta-analysis of factor analyses: an illustration based on the cumulation of correlation matrices. Psychol Method 1:341-353.
Bellinger DC. 2009. Interpreting epidemiologic studies of developmental neurotoxicity: conceptual and analytic issues, Neurotoxicol Teratol 31:267-274.
Berlin JA, Colditz GA. 1999. The role of meta-analysis in the regulatory process for foods, drugs, and devices. JAMA 281:830-834.
Boucher O, Muckle G, Bastien CH. 2009. Prenatal exposure to polychlorinated biphenyls: a neuropsychologic analysis. Environ Health Perspect 117:7-16.
Cicchetti DV, Kaufman AS, Sparrow SS. 2004. The relationship between prenatal and postnatal exposure to polychlorinated biphenyls (PCBs) and cognitive, neuropsychological, and behavioral deficits: a critical appraisal. Psychol Schools 41:589-624.
Daniels JL, Longnecker MP, Klebanoff MA, Gray KA, Brock. JW, Zhou H, et al. 2003. Prenatal exposure to low-level polychlorinated biphenyls in relation to mental and motor development at 8 months. Am J Epidemiol 157:485-492.
Darvill T, Lonky E, Reihman J, Stewart P, Pagano J. 2000. Prenatal exposure to PCBs and infant performance on the Fagan Test of Infant Intelligence. Neurotoxicology 21:1029-1038.
Despres C, Beuter A, Richer F, Poitras K, Veilleux A, Ayotte P, et al. 2005. Neuromotor functions in Inuit preschool children exposed to Pb, PCBs, and Hg. Neurotoxicol Teratol 27:245-257.
Dwan K, Altman DG, Arnaiz JA, Bloom J, Chan AW, Cronin E, et al. 2008. Systematic review of the empirical evidence of study publication bias and outcome reporting bias. PLoS One 3:e3081.
Easterbrook PJ, Berlin JA, Gopalan R, Matthews DR. 1991. Publication bias in clinical research. Lancet 337:867-872.
Garabrant DH. 2000. Epidemiologic principles in the evaluation of suspected neurotoxic disorders. Neurol Clin 18:631-648.
Gladen BC, Rogan WJ. 1991. Effects of perinatal polychlorinated biphenyls and dichlorodiphenyl dichloroethene on later development. J Pediatr 119:58-63.
Gladen BC, Rogan WJ, Hardy P, Thullen J, Tingelstad J, Tully M. 1988. Development after exposure to polychlorinated biphenyls and dichlorodiphenyl dichloroethene transplacentally and through human milk. J Pediatr 113:991-995.
Grandjean P, Weihe P, Burse VW, Needham LL, Storr-Hansen E, Heinzow B, et al. 2001. Neurobehavioral deficits associated with PCB in 7-year-old children prenatally exposed to seafood neurotoxicants. Neurotoxicol Teratol 23:305-317.
Huisman M, Koopman-Esseboom C, Fidler V, Hadders-Algra M, van der Paauw CG, Tuinstra LG, et al. 1995a. Perinatal exposure to polychlorinated biphenyls and dioxins and its effect on neonatal neurological development. Early Hum Dev 41:111-127.
Huisman M, Koopman-Esseboom C, Lanting CI, van der Paauw CG, Tuinstra LG, Fidler V, et al. 1995b. Neurological condition in 18-month-old children perinatally exposed to polychlorinated biphenyls and dioxins. Early Hum Dev 43:165-176.
Jacobson JL, Fein G, Schwartz PM, Dowler JK. 1984. Prenatal exposure to an environmental toxin: a test of the mutliple effects model. Dev Psychol 20:523-532.
Jacobson JL, Jacobson SW, Humphrey HE. 1990. Effects of in utero exposure to polychlorinated biphenyls and related contaminants on cognitive functioning in young children. J Pediatr 116:38-45.
Jacobson SW, Fein GG, Jacobson JL, Schwartz PM, Dowler JK. 1985. The effect of intrauterine PCB exposure on visual recognition memory. Child Dev 56:853-860.
Jacobson SW, Jacobson JL, Fein GG. 1986. Environmental toxins and infant development. In: Theory and Research in Behavioral Pediatrics (Fitzgerald HE, ed). New York:Plenum Press, 96-146.
Kimbrough RD, Krouskas CA. 2003. Human exposure to polychlorinated biphenyls and health effects: a critical synopsis. Toxicol Rev 22:217-233.
Koopman-Esseboom C, Weisglas-Kuperus N, de Ridder MA, Van der Paauw CG, Tuinstra LG, Sauer PJ. 1996. Effects of polychlorinated biphenyl/dioxin exposure and feeding type on infants' mental and psychomotor development. Pediatrics 97:700-706.
Lindsay RM, Ehrenberg ASC. 1993. The design of replicated studies. Am Stat 47:217-228.
Lipsey MW, Wilson DB. 2001. Practical Meta-Analysis. Thousand Oaks, CA:Sage Publications.
Lonky E, Reihman J, Darvill T, Mather J, Daly H. 1996. Neonatal Behavioral Assessment Scale performance in humans influenced by maternal consumption of environmentally contaminated Lake Ontario fish. J Great Lakes Res 22:98-212.
Nakajima S, Saijo Y, Kato S, Sasaki S, Uno A, Kanagami N, et al. 2006. Effects of prenatal exposure to polychlorinated biphenyls and dioxins on mental and motor development in Japanese children at 6 months of age. Environ Health Perspect 114:773-778.
Pan IJ, Daniels JL, Goldman BD, Herring AH, Siega-Riz AM, Rogan WJ. 2009. Lactational exposure to polychlorinated biphenyls, dichlorodiphenyltrichloroethane, and dichlorodiphenyldichloroethylene and infant neurodevelopment: an analysis of the Pregnancy, Infection, and Nutrition Babies Study. Environ Health Perspect 117:488-94.
Rogan WJ, Gladen BC, McKinney JD, Carreras N, Hardy P, Thullen J, et al. 1986. Neonatal effects of transplacental exposure to PCBs and DDE. J Pediatr 109:335-341.
Sattler J. 2008. Assessment of Children: Cognitive Foundations, 5th ed. San Diego:J Sattler.
Schantz SL, Widholm JJ, Rice DC. 2003. Effects of PCB exposure on neuropsychological function in children. Environ Health Perspect 111:357-576.
Steuerwald U, Weihe P, Jorgensen PJ, Bjerve K, Brock J, Heinzow B, et al. 2000. Maternal seafood diet, methylmercury exposure, and neonatal neurologic function. J Pediatr 136:599-605.
Stewart P, Reihman J, Lonky E, Darvill T, Pagano J. 2000. Prenatal PCB exposure and Neonatal Behavioral Assessment Scale (NBAS) performance. Neurotoxicol Teratol 22:21-29.
Stewart PW, Reihman J, Lonky El, Darvill TJ, Pagano J. 2003. Cognitive development in preschool children prenatally exposed to PCBs and MeHg. Neurotoxicol Teratol 25:11-22.
Touwen BC, Huisjes HJ, Jurgens-van der Zee AD, Biermanvan Eendenburg ME, Smrkovsky M, Olinga AA. 1980. Obstetrical condition and neonatal neurological morbidity. An analysis with the help of the optimality concept. Early Hum Dev 4:207-228.
Treadwell JR, Tregear SJ, Reston JT, Turkelson CM. 2006. A system for rating the stability and strength of medical evidence. BMC Med Res Methodol 6:52; doi:10.1186/1471-2288-6-52 [Online 19 October 2006].
Walkowiak J, Wiener JA, Fastabend A, Heinzow B, Kramer U, Schmidt E, et al. 2001. Environmental exposure to polychlorinated biphenyls and quality of the home environment: effects on psychodevelopment in early childhood. Lancet 358:1602-1607.
Weed DL. 2005. Weight of evidence: a review of concept and methods. Risk Anal 25:1545-1557.
Wigle DT, Arbuckle TE, Turner MC, Berube A, Yang Q, Liu S, et al. 2008. Epidemiologic evidence of relationships between reproductive and child health outcomes and environmental chemical contaminants. J Toxicol Environ Health B Crit Rev 11:373-517.
Wilhelm M, Wittsiepe J, Lemm F, Ranft U, Kramer U, Furst P, et al. 2008. The Duisburg birth cohort study: influence of the prenatal exposure to PCDD/Fs and dioxin-like PCBs on thyroid hormone status in newborns and neurodevelopment of infants until the age of 24 months. Mutat Res 659:83-92.
Winneke G, Boersma ER, Grandjean P, Kramer U, Steingruber HJ, Weisglas-Kuperus N. 2003. Outcome of early developmental PCB-exposure in 42-months-old children: results from the multicentric European cohort study: ISEE-243. Epidemiology 14:S46.
Winneke G, Bucholski A, Heinzow B, Kramer U, Schmidt E, Walkowiak J, et al. 1998. Developmental neurotoxicity of polychlorinated biphenyls (PCBs): cognitive and psychomotor functions in 7-month old children. Toxicol Lett 102-103:423-428.
Youngstrom E, Gutermuth Anthony L, LaKind JS, Kenworthy L, Lipkin PH, Goodman M, et al. 2010. Advancing the selection of neurodevelopmental measures in epidemiological studies of environmental chemical exposure and health effects. Int J Environ Res Public Health 7:229-268.
Address correspondence to J.S. LaKind, LaKind Associates, LLC, 106 Oakdale Ave., Catonsville, MD, USA 21228. Telephone: (410)788-8639. Fax: (410) 788-8639. E-mail; lakindassoc@comcast.net
We gracefully acknowledge the input and recommendations regarding interstudy consistency in neurodevelopmental epidemiologic research from panel members at a neurodevelopmental function testing workshop held in Baltimore, MD, in June 2009. In addition to the authors of this review, the panel consisted of A.S. Carter (University of Massachusetts, Boston), C. Einspieler (Medical University of Graz), T. Frazier II (Cleveland Clinic Children's Hospital), M. Gerdes (Children's Hospital of Philadelphia), M. Hadders-Algra (University Medical Center Groningen), W.E. Kaufmann and E.M. Mahone (Kennedy Krieger Institute), S.L. Makris (U.S. Environmental Protection Agency), and P. Thorsen (Rollins School of Public Health, Emory University).
This research was supported by a grant from Cefic-Long-range Research Initiative (LRI). Cefic-LRI was not involved in the design, collection, management, analysis, or interpretation of the data or in the preparation or approval of the manuscript.
Mention of trade names or commercial products does not constitute endorsement or recommendation for use. The findings and conclusions in this article are those of the authors and do not necessarily represent the views of Cefic-LRI or the National Institutes of Health.
J.S.L. consults to both government and industry. P.H.L. was a consultant to Bristol Myers Squibb from 2008 to 2009; he has no current actual or potential competing financial interests. E.Y. received travel funding from Otsuka/Bristol Myers Squibb in 2009. The other authors declare they have no actual or potential competing financial interests.
Received 16 December 2009; accepted 22 February 2010.
Michael Goodman, (1) Katherine Squibb, (2) Eric Youngstrom, (3), (4) Laura Gutermuth Anthony, (5), (6), (7) Lauren Kenworthy, (5), (6), (7), (8) Paul H. Lipkin, (9), (10) Donald R. Mattison, (11) and Judy S. LaKind (12), (13), (14)
(1) Department of Epidemiology, Emory University School of Public Health, Atlanta, Georgia, USA;
(2) Department of Medicine, University of Maryland School of Medicine, Baltimore, Maryland, USA;
(3) Department of Psychology, and
(4) Department of Psychiatry, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA;
(5) Center for Autism Spectrum Disorders, Children's National Medical Center, Washington, DC, USA;
(6) Department of Pediatrics,
(7) Department of Psychiatry, and
(8) Department of Neurology, George Washington University School of Medicine, Washington, DC, USA;
(9) Center for Development and Learning, Kennedy Krieger Institute, Baltimore, MD, USA;
(10) Department of Pediatrics, Johns Hopkins University School of Medicine, Baltimore, Maryland, USA;
(11) Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Department of Health and Human Services, Bethesda, Maryland, USA;
(12) LaKind Associates, LLC, Catonsville, Maryland, USA;
(13) Department of Epidemiology and Preventive Medicine, University of Maryland School of Medicine, Baltimore, Maryland, USA;
(14) Department of Pediatrics, Milton S. Hershey Medical Center, Penn State College of Medicine, Hershey, Pennsylvania, USA
![]() ![]() ![]() ![]() | |
Title Annotation: | Review; polychlorinated biphenyls |
---|---|
Author: | Goodman, Michael; Squibb, Katherine; Youngstrom, Eric; Anthony, Laura Gutermuth; Kenworthy, Lauren; |
Publication: | Environmental Health Perspectives |
Article Type: | Report |
Geographic Code: | 1USA |
Date: | Jun 1, 2010 |
Words: | 9264 |
Previous Article: | The History, Use, Disposition and Environmental Fate of Agent Orange. |
Next Article: | Do TETRA (Airwave) base station signals have a short-term impact on health and well-being? A randomized double-blind provocation study. |
Topics: |