Discovery of novel biomarkers by microarray analysis of peripheral blood mononuclear cell gene expression in benzene-exposed workers.Benzene is an industrial chemical and component of gasoline that is an established cause of leukemia. To better understand the risk benzene poses, we examined the effect of benzene exposure on peripheral blood peripheral blood Cardiology Blood circulating in the system/body mononuclear mononuclear /mono·nu·cle·ar/ (-noo´kle-er) 1. having but one nucleus. 2. a cell having a single nucleus, especially a monocyte of the blood or tissues. mon·o·nu·cle·ar adj. cell (PBMC PBMC Peripheral Blood Mononuclear Cell ) gene expression in a population of shoe-factory workers with well-characterized occupational exposures using microarrays and real-time polymerase chain reaction In Molecular Biology, real-time polymerase chain reaction, also called quantitative real time polymerase chain reaction (QRT-PCR) or kinetic polymerase chain reaction (PCR PCR polymerase chain reaction. PCR abbr. polymerase chain reaction Polymerase chain reaction (PCR) ). PBMC RNA RNA: see nucleic acid. RNA in full ribonucleic acid One of the two main types of nucleic acid (the other being DNA), which functions in cellular protein synthesis in all living cells and replaces DNA as the carrier of genetic was stabilized in the field and analyzed using a comprehensive human array, the U133A/B A/B Airborne A/B Afterburner (jet engines) A/B Air Blast A/B Answerback A/B Auto-brake A/B Air Bus A/B Afterburning Affymetrix GeneChip set. A matched analysis of six exposed-control pairs was performed. A combination of robust multiarray analysis and ordering of genes using paired t-statistics, along with bootstrapping Bootstrapping A procedure used to calculate the zero coupon yield curve from market figures. Notes: Since the T-bills offered by the government are not available for every time period, the bootstrapping method is used to fill in the missing figures in order to derive the to control for a 5% familywise error rate In statistics, familywise error rate (FWER) is the probability of making one or more false discoveries, or type I errors among all the hypotheses when performing multiple pairwise tests[1][2]. , was used to identify differentially expressed genes in a global analysis. This resulted in a set of 29 known genes being identified that were highly likely to be differentially expressed. We also repeated these analyses on a smaller subset of 508 cytokine Cytokine Any of a group of soluble proteins that are released by a cell to send messages which are delivered to the same cell (autocrine), an adjacent cell (paracrine), or a distant cell (endocrine). probe sets and found that the expression of 19 known cytokine genes was significantly different between the exposed and the control subjects. Six genes were selected for confirmation by real-time PCR, and of these, CXCL16, ZNF ZNF Zinc Finger (protein; biology) 331, JUN, and PF4 were the most significantly affected by benzene exposure, a finding that was confirmed in a larger data set from 28 subjects. The altered expression was not caused by changes in the makeup of the PBMC fraction. Thus, microarray analysis along with real-time PCR confirmation reveals that altered expressions of CXCL16, ZNF331, JUN, and PF4 are potential biomarkers of benzene exposure. Key words: Affymetrix, benzene, biomarkers, blood, expression profiling Microarray technology is often used for gene expression profiling. It makes use of the sequence resources created by the genome sequencing projects and other sequencing efforts to answer the question, , leukemia, lymphocyte lymphocyte: see blood; immunity. lymphocyte Type of leukocyte fundamental to the immune system, regulating and participating in acquired immunity. Each has receptor molecules on its surface that bind to a specific antigen. , microarray, molecular epidemiology molecular epidemiology Molecular medicine An evolving field that combines the tools of standard epidemiology–case studies, questionnaires and monitoring of exposure to external factors with the tools of molecular biology–eg, restriction endonucleases, , occupational exposure, real-time PCR. Environ Health Perspect 113:801-807 (2005). doi:10.1289/ehp.7635 available via http://dx.doi.org/[Online 14 March 2005] ********** Benzene is an important industrial chemical (> 2 billion gallons produced annually in the United States United States, officially United States of America, republic (2005 est. pop. 295,734,000), 3,539,227 sq mi (9,166,598 sq km), North America. The United States is the world's third largest country in population and the fourth largest country in area. ) and component of gasoline (Gist and Burg 1997). Its toxic effects on the blood and bone marrow include leukopenia leukopenia /leu·ko·pe·nia/ (-pe´ne-ah) reduction of the number of leukocytes in the blood below about 5000 per cubic mm.leukope´nic basophilic leukopenia basophilopenia. , pancytopenia pancytopenia /pan·cy·to·pe·nia/ (-sit-ah-pe´ne-ah) abnormal depression of all the cellular elements of the blood. pan·cy·to·pe·ni·a n. , and aplastic anemia aplastic anemia or anemia of bone-marrow failure Inadequate blood-cell formation by bone marrow. Pancytopenia is the lack of all blood-cell types (erythrocytes, leukocytes, and platelets), but any combination may be missing. , and it is also an established cause of human leukemia (Snyder 2002). However, the mechanisms of benzene-induced hematotoxicity and leukemo-genesis remain unclear, as does the risk benzene poses at low levels of exposure (Krewski et al. 2000). To shed further light on these mechanisms and better understand the risk benzene poses, we examined the effects of benzene exposure on peripheral blood mononuclear cell (PBMC) gene expression in a population of shoe-factory workers with well-characterized occupational exposures to benzene using cDNA microarrays and real-time polymerase chain reaction (PCR). Microarrays use immobilized cDNA or oligonucleotide probes to simultaneously monitor the expression of thousands of genes and obtain a view of global gene expression (i.e., a view of all mRNA transcripts expressed by a cell is known as the transcriptome The transcriptome is the set of all messenger RNA (mRNA) molecules, or "transcripts", produced in one or a population of cells. The term can be applied to the total set of transcripts in a given organism, or to the specific subset of transcripts present in a particular cell type. ) (Staudt 2003; Staudt and Brown 2000) and are becoming increasingly used in toxicology (Hamadeh et al. 2002; Waters et al. 2003). They have also been used recently to investigate variation in gene expression in the peripheral blood leukocytes of normal individuals (Whitney et al. 2003). We hypothesized that microarrays could identify changes in gene expression that could be used as new biomarkers of exposure and early effect for benzene and provide information on mechanisms of benzene toxicity. One potential problem with using microarrays in epidemiologic studies is that mRNA is unstable (Thach et al. 2003). Most epidemiologic studies that have collected biologic samples have not collected material that contains stabilized RNA for analysis. Here, we have overcome this problem by performing the first step of RNA isolation in the field and stabilizing the RNA for later analysis. We have analyzed this RNA from selected subjects using a comprehensive and standardized human array, the U133A/B Affymetrix GeneChip set (Iacobuzio-Donahue et al. 2003). U133A and U133B chips together contain almost 45,000 probe sets, representing > 39,000 unique transcripts derived from approximately 33,000 well-substantiated human genes, allowing investigators to obtain a global view of gene expression. We performed a proof-of-principle study in which we examined global gene expression in a small number of well-matched exposed--control subject pairs. Genes with differential expression were then ranked and selected for further examination using several forms of statistical analysis. We also specifically examined the expression of all cytokine genes on the array under the a priori a priori In epistemology, knowledge that is independent of all particular experiences, as opposed to a posteriori (or empirical) knowledge, which derives from experience. hypothesis that these key genes involved in immune function Immune function The state in which the body recognizes foreign materials and is able to neutralize them before they can do any harm. Mentioned in: Herbalism, Traditional Chinese, Stress Reduction are likely to be altered by benzene exposure (Aoyama 1986). We then attempted to confirm the array findings for the leading differentially expressed genes using real-time PCR, which is thought to be more accurate than microarray analysis but can be used only to investigate a few genes at a time (Etienne et al. 2004). Once these genes were confirmed in the paired analysis, we examined their expression in a larger number of benzene-exposed subjects and controls. The overall goal is to provide potential gene markers of exposure and early effect for benzene and to produce mechanistic insight into how benzene affects the body, especially the immune system immune system Cells, cell products, organs, and structures of the body involved in the detection and destruction of foreign invaders, such as bacteria, viruses, and cancer cells. Immunity is based on the system's ability to launch a defense against such invaders. and lymphocyte function. Materials and Methods Study subjects and exposure assessment. We studied workers exposed to benzene in two shoe manufacturing factories and unexposed controls from three clothes manufacturing factories in the same region of Tianjin, China. The study was approved by institutional review boards at all institutions. Participation was voluntary, written informed consent was obtained, and the participation rate was approximately 95%. An initial group of six workers was selected from among the more highly exposed workers (mean benzene [+ or -] SD = 47.3 [+ or -] 24.3 ppm), and six controls were frequency-matched to these subjects on the basis of age and sex. Mean age was 33.7 [+ or -] 7.1 years for the six exposed workers and 31 [+ or -] 6.7 years for the controls. Four pairs were male and the other two female. Before phlebotomy Phlebotomy Definition Phlebotomy is the act of drawing or removing blood from the circulatory system through a cut (incision) or puncture in order to obtain a sample for analysis and diagnosis. , individual benzene and toluene toluene (tōl`y ēn') or methylbenzene (mĕth'əlbĕn`zēn), C7H8 exposure was
monitored by each wearing an organic vapor passive monitor badge as
previously described (Vermeulen et al. 2004). Personal full-shift air
monitoring was conducted about every month over a 3- to 4-month period
before biologic sample collection. Benzene and toluene were not detected
in air samples from the control factories.Each subject was given a physical exam by a study physician. A questionnaire was administered that requested detailed information on occupation, environmental exposures to solvents and pesticides, past and current tobacco and alcohol use, past and current medical history including recent infections, diagnostic and therapeutic ionizing radiation i·on·i·zing radiation n. High-energy radiation capable of producing ionization in substances through which it passes. Ionizing radiation exposure, medication use, family history, and a food frequency questionnaire developed for use in northern China. Biologic sample collection. Peripheral blood, buccal buc·cal adj. 1. Of, relating to, adjacent to, or in the direction of the cheek. 2. Of or relating to the mouth cavity. buccal cells, and urine were collected from each subject at the beginning of the workday around 0900 hr and were processed within 6 hr of collection. White blood cell differential counts white blood cell differential count See 'Diff.'. and the levels of natural killer (NK) cells, B lymphocytes, CD[4.sup.+] and CD[8.sup.+] T lymphocytes were determined. The PBMC fraction, consisting of lymphocytes Lymphocytes Small white blood cells that bear the major responsibility for carrying out the activities of the immune system; they number about 1 trillion. , monocytes monocytes, n.pl the largest of the white blood cells. They have one nucleus and a large amount of grayish-blue cytoplasm. Develop into macrophages and both consume foreign material and alert T cells to its presence. , and some platelets, was isolated in the field using Ficoll-Paque (Amersham, Piscataway, NJ). One to five million PBMCs were added to 1 mL RLT RLT Revocable Living Trust RLT Relating to RLT Real Life Test RLT Raleigh Little Theatre RLT Regimental Landing Team RLT Regional Leadership Team RLT Real Life Technologies (trademark of Hewlett-Packard) RLT Release Link Trunk buffer (Qiagen, Valencia, CA) containing 1% [beta]-mercaptoethanol to preserve RNA in the cells. RNA that is frozen in this buffer at -80[degrees]C is highly stable. RNA isolation, amplification, and hybridization hybridization /hy·brid·iza·tion/ (hi?brid-i-za´shun) 1. crossbreeding; the act or process of producing hybrids. 2. molecular hybridization 3. . We isolated total RNA using RNeasy mini kits (Qiagen) according to according to prep. 1. As stated or indicated by; on the authority of: according to historians. 2. In keeping with: according to instructions. 3. manufacturer instructions and quantified using a SmartSpec 3000 (Bio-Rad, Hercules, CA). Only samples with an A260/A280 between 1.7 and 2.2 were considered suitable for use. Samples were prepared according to the GeneChip Eukaryotic eukaryotic /eu·kary·ot·ic/ (u?kar-e-ot´ik) pertaining to a eukaryon or to a eukaryote. eukaryotic pertaining to eukaryosis. eukaryotic cells see cell. Small Sample Target Labeling Assay Version II (Affymetrix 2003a), with the exception that the GeneChip Sample Cleanup Module (Affymetrix, Santa Clara Santa Clara, city, Cuba Santa Clara (sän`tä klä`rä), city (1994 est. pop. 217,000), capital of Villa Clara prov., central Cuba. , CA) was used and not ethanol precipitation Ethanol precipitation is a method used to concentrate DNA. DNA is polar, soluble in water which is polar as well. Based on the principle of "like dissolves like", it is insoluble in the relatively less polar ethanol. . Total RNA (100 ng) was amplified for each sample, with 400 ng of first-round cRNA used for the second round of cDNA synthesis. Second-round cRNA (15 [micro]g) was fragmented in 30 [micro]L of 1x fragmentation buffer. Hybridization cocktails were made as described in the GeneChip Expression Analysis Technical Manual (Affymetrix 2003b) and hybridized to U133A chips at 60 rpm, 45[degrees]C. After 16 hr, the hybridization cocktails were removed, added back to the unused hybridization cocktails, and stored at -80[degrees]C. GeneChips were stained with streptavidin-phycoerythrin using the EukGE-WS2 protocol (Affymetrix 2003b). GeneChips were scanned twice using a GeneChip Scanner GA 2500 (Affymetrix). Frozen hybridization cocktails were heated to 65[degrees]C for 5 min and then applied to U133B chips as described for U133A chips [of the 45,000 probe sets, only 100 (which can be used for normalization In relational database management, a process that breaks down data into record groups for efficient processing. There are six stages. By the third stage (third normal form), data are identified only by the key field in their record. ) are found on both chips, so this "recycling" of hybridization cocktail should not affect the results]. Chips were then analyzed as described below. Chip normalization. To allow comparison, all chips were scaled to a target intensity of 500 based on all probe sets on each chip. Samples were run blind so that exposure status was unknown and designated as being either group A or B. Group A chips were used as the baselines when analyzing chips from group B. Statistical analysis to identify differentially expressed genes. Robust multiarray analysis (RMA (RealMedia Architecture) See RealMedia. ) (Irizarry et al. 2003) was used to analyze the data produced by the chips. Two RMA analyses of the GeneChip data were performed. First, we performed a global gene analysis that looked at all genes on both chips simultaneously. Probe sets for which expression was significantly different between exposed and unexposed individuals were identified using a standard paired t-test, and a recently developed bootstrapping technique to provide a critical value adjusted to provide a 5% familywise error rate (FWER FWER Family Wise Error Rate ), the standard value used in the literature. The bootstrapping technique can provide a more accurate (and less conservative) FWER than standard methods (e.g., Bonferroni's adjustment) (Pollard and Van der Laan 2003). As Dudoit et al. (2004) noted, bootstrapping resampling techniques can be used to directly model the joint distribution of the null test statistics so that the dependence of genes is implicitly factored in when determining error rates for different cutoffs. The main advantage of this technique over others such as Bonferroni's is that it can provide accurate control of error rates even when gene expressions on the same chip are statistically dependent (in this case, Bonferroni is often very conservative). However, the theory developed for the technique is asymptotic, and its performance can be less than optimal with very small sample sizes (due to random sampling of matched array pairs with replacement causing an excessive number of ties in some samples). RMA (normalization, background correction, and calculation of expression) was applied to all genes and all chips simultaneously. We then performed a targeted analysis of cytokine genes by applying the multiple testing procedures to this subset after RMA processing was completed. This was based on the a priori hypothesis that cytokines Cytokines Chemicals made by the cells that act on other cells to stimulate or inhibit their function. Cytokines that stimulate growth are called "growth factors. involved in the immune response immune response n. An integrated bodily response to an antigen, especially one mediated by lymphocytes and involving recognition of antigens by specific antibodies or previously sensitized lymphocytes. should be affected by benzene exposure because of its known immunotoxicity, and we thus derived more power to select differentially expressed cytokine genes by limiting the analysis only to this subset. The subset of 508 cytokine probe sets represented on the U133 chips was identified using NetAffx (http://www.affymetrix.com/analysis/ index.affx) and the key word "cytokine." Real-time PCR confirmation using TaqMan. Total RNA (100 ng) was converted to cDNA using the SuperScript Any letter, digit or symbol that appears above the line. For example, 10 to the 9th power is written with the 9 in superscript (109). Contrast with subscript. First-Strand Synthesis System for reverse-transcriptase PCR (Invitrogen, Carlsbad, CA) using oligo dT primers according to the manufacturer's instructions. This cDNA was used to confirm GeneChip findings using TaqMan Gene Expression Assays (TMGEAs; Applied Biosystems Applied Biosystems, Inc. (formerly NASDAQ: ABIO) is the original name of a pioneer biotechnology company founded in 1981 in Foster City, California, among the Silicon Valley cities of the southern San Francisco Bay Area. , Foster City, CA). Assays were run in quadruplicate quad·ru·pli·cate adj. 1. Multiplied by four; quadruple. 2. Fourth in a group of four identical things. n. One of a group of four identical things. tr. & intr.v. with 1x TaqMan Master Mix (Applied Biosystems), 1x assay mix, and 50 ng of cDNA in each 25-[micro]L reaction for six genes plus TATA box TATA box a eukaryotic DNA sequence usually TATAAATA, similar to the Pribnow box of Escherichia coli, occurring in the promoter region 25 to 35 bases upstream from the transcriptional start site that binds the general transcription factor TFIID which begins the formation of binding protein (TBP TBP To Be Provided/Published TBP TATA-Box-Binding Protein TBP Tau Beta Pi (National Engineers Honors Society) TBP The Black Parade TBP Tributylphosphate TBP To Be Printed TBP To Be Produced TBP True Boiling Point ) as an endogenous control (32 reactions/sample). Reactions were run on Applied Biosystems ABI Abi (ā`bī) [short for Abijah], in the Bible, King Hezekiah's mother. (Application Binary Interface) A specification for a specific hardware platform combined with the operating system. PRISM 7700 Sequence Detection System as follows: 95[degrees]C for 10 min, followed by 40 cycles of 95[degrees]C for 15 sec, 60[degrees]C for 1 min. The 12 samples that had been run on chips were run in exposed-unexposed pairs to reduce experimental variability. The mean baseline (TBP) threshold cycle ([C.sub.t]) was subtracted from the mean [C.sub.t] for the other six assays to normalize normalize to convert a set of data by, for example, converting them to logarithms or reciprocals so that their previous non-normal distribution is converted to a normal one. results. These were then compared between exposed and unexposed sample pairs. Assays used were as follows: TBP (endogenous control), Hs99999910_m1; CXCL16, Hs00222859_m1; IL4R, Hs00166237_m1; JUN, Hs00277190_s1; PF4, Hs00427220_g1; PTPRE, Hs00369944_m1; ZNF331, Hs00367929_m1. Results Differential global gene expression in the exposed-control matched pairs. PBMC RNA from six matched pairs of subjects (one exposed and one age- and sex-matched control in each pair) was analyzed on Affymetrix GeneChips. RMA analysis of the data using paired t-statistics, bootstrapping, and a 5% FWER indicated that 2,129 probe sets were significantly different in people exposed to high levels of benzene compared with matched unexposed subjects. Expression of 964 of these probe sets was decreased, and 1,165 were increased. Table 1 shows the top 25 up-regulated probe sets identified on the basis of the lowest p-values, and Table 2 shows the top 25 down-regulated probe sets. A number of the probe sets identified in the top 50 were expressed sequence tags (ESTs) or coded only for hypothetical proteins. Twenty-nine probe sets corresponded to genes coding for known proteins. Of these, the gene for HSPA1A HSPA1A Heat-Shock 70-KD Protein 1A was the most strongly down-regulated (-66%) (Table 1), and that for CREM CREM Corporate Real Estate Management CREM Classified Removable Electronic Media CREM cAMP Response Element Modulator CREM Center for Rural Emergency Medicine (West Virginia University) was the most strongly up-regulated (+145%) (Table 2). The significance of this latter finding is unclear because CREM has very low expression in lymphocytes. Other genes of note that were up-regulated were ZNF331, PTPRE, toll-like receptor Toll-like receptors (TLRs) are a class of single membrane-spanning non-catalytic receptors that recognize structurally conserved molecules derived from microbes once they have breached physical barriers such as the skin or intestinal tract mucosa, and activate immune cell 2, the chemokine chemokine /che·mo·kine/ (ke´mo-kin) any of a group of low molecular weight cytokines identified on the basis of their ability to induce chemotaxis or chemokinesis in leukocytes (or in particular populations of leukocytes) in inflammation. CXCL16, and CD44 antigen (Table 1). Note that CD44 and ZNF331 are present on both A and B chips and so they are listed twice in Table 1, but both have similar p-values and expression ratios on the A and B chips, providing a good quality control check. Other genes of interest that were significantly down-regulated include the oncogene oncogene Gene that can cause cancer. It is a sequence of DNA that has been altered or mutated from its original form, the proto-oncogene (see mutation). Proto-oncogenes promote the specialization and division of normal cells. JUN, MAP4, and CALM1 (Table 2). Differential cytokine gene expression in the exposed-control matched pairs. RMA analysis of the subgroup of 508 cytokine probe sets on the chip indicated that the expression of 19 cytokine genes was significantly different between the exposed and control subjects (Table 3). IFNGR1, IL6R IL6R Interleukin 6 Receptor , CCNT CCNT Constellation Communication and Navigation Transceiver 2, PBEF PBEF Particle Beam Engineering Facility 1, and PPP (Point-to-Point Protocol) The most popular method for transporting IP packets over a serial link between the user and the ISP. Developed in 1994 by the IETF and superseding the SLIP protocol, PPP establishes the session between the user's computer and the ISP using 1CB were identified by two probe sets, so 28 identification numbers (IDs) are listed in Table 3. The 19 differentially expressed cytokine genes were also identified in the global analysis, but only a few had p-values low enough to be listed in Tables 1 and 2. However, several had high ratios of differential expression between exposed and controls, with PBEF1, IFNGR1, and CXCL16 being increased around 100% (Table 3). Interestingly, six of the up-regulated genes were receptors for interleukins 2, 4, 6, 10, and 11 and interferon gamma interferon gamma IFN-γ A 21-25 kD glycoprotein lymphokine encoded on chromosome 12q and produced by activated T and NK cells; IFN-γ is antiviral, regulates class II MHC antigen expression, Fc receptors and immunoglobulin production and class switching, , the latter being the most strongly down-regulated cytokine gene. PF4 was the second most significantly down-regulated gene (Table 3). Confirmation by real-time PCR. Four genes--CXCL16, JUN, PTPRE, and ZNF331--were chosen from the global analysis and two genes--L4R, PF4--from the cytokine subset for further study and confirmation by real-time PCR. We selected the global analysis genes for further study by first removing ESTs, hypothetical proteins, and genes with low levels of expression. From the remaining genes, we used magnitude and direction of change and availability of TMGEAs at the time of this analysis to decide which to confirm by real-time PCR. Using these three parameters, we chose three of the most significantly up-regulated genes and one strongly down-regulated gene (JUN) for confirmation. IL4R and PF4 were chosen for confirmation from the cytokine subset because they were, respectively, the most significantly up-regulated and down-regulated cytokine genes for which TMGEAs were available at the time of this analysis. Real-time PCR of RNA from the six exposed-control pairs tested by GeneChips confirmed that CXCL16 and ZNF331 were consistently up-regulated in exposed individuals (mean increases of 103% and 113%, respectively) and that JUN and PF4 were consistently down-regulated in exposed individuals (mean decreases of 81% and 58%, respectively) when compared with unexposed individuals (Figure 1). These differences in expression are very similar to those found by GeneChip analysis (Table 1). Results for IL4R and PTPRE were less concordant, with increases in some pairs and decreases in others (Figure 1). [FIGURE 1 OMITTED] Effect of benzene exposure on the expression of the differentially expressed genes. Having shown 100% concordance concordance /con·cor·dance/ (-kord´ins) in genetics, the occurrence of a given trait in both members of a twin pair.concor´dant con·cor·dance n. for CXCL16, ZNF331, JUN, and PF4 between array and real-time data Real-time data denotes information that is delivered immediately after collection. There is no delay in the timeliness of the information provided. Some uses of this term confuse it with the term dynamic data. in six matched pairs of benzene-exposed workers and controls, we studied their expression using real-time PCR in a larger set of exposed workers and matched controls (Table 4). RNA from the PBMCs of 13 highly exposed subjects (mean benzene = 43.7 [+ or -] 23.9 ppm) and 15 controls was examined (the original six matched pairs of subjects included). The exposed and unexposed subjects were matched on the basis of gender (p = 0.7), age (p = 0.48), current smoking status, and recent infections (Table 4). We also tested the effect of each covariate, and none negatively confounded (i.e., weakened) the impact of benzene exposure on any of the end points. In this larger data set, CXCL16 and ZNF331 were again shown to be very significantly up-regulated, and JUN and PF4 significantly down-regulated (Table 4). Thus, CXCL16, ZNF331, JUN, and PF4 are four genes clearly identified as being differentially expressed after benzene exposure. Lack of potential confounding confounding when the effects of two, or more, processes on results cannot be separated, the results are said to be confounded, a cause of bias in disease studies. confounding factor by changes in lymphocyte subsets. It is well established that benzene lowers peripheral blood lymphocyte counts (Qu et al. 2002; Rothman et al. 1996), and certain lymphocyte subsets may be more sensitive to benzene's effects than are others. This raises a concern that our findings could be explained in part by a different distribution of lymphocyte subset populations in workers exposed to benzene compared with controls. To address this potential confounding, we first evaluated the distribution of all measured cell populations that comprise the PBMCs from which mRNA was isolated (Table 5). Total mononuclear cells (i.e., monocytes, CD[4.sup.+] T, CD[8.sup.+] T, CD[19.sup.+] B), lymphocytes, and CD56 (NK) cells were significantly decreased in exposed workers compared with controls (p = 0.0052; Table 5). Further, the percentage of total mononuclear cells composed of B cells (i.e., B-cell mononuclear percentage) in the exposed workers was significantly less than that in controls (p = 0.0061), whereas the CD[8.sup.+] T-cell mononuclear percentage was significantly increased (p = 0.0096). Using linear regression Linear regression A statistical technique for fitting a straight line to a set of data points. , we determined that the proportion of the mononuclear cell fraction made up by each of the five cell types had no impact on expression of CXCL16, ZNF331, JUN, and PF4. Further, the strength and direction of the association between benzene exposure and gene expression were only minimally changed after adjusting for both CD[8.sup.+] T-cell and CD[19.sup.+] B-cell mononuclear cell number and percentages. Discussion To our knowledge, this is the first molecular epidemiologic study to use whole-genome Affymetrix GeneChips for in vivo in vivo /in vi·vo/ (ve´vo) [L.] within the living body. in vi·vo adj. Within a living organism. in vivo adv. studies of the effects of a specific chemical exposure in humans. A limited number of earlier studies have looked at selected subsets of genes (Wu et al. 2003) or at the effects of smoking (Lampe et al. 2004), but none has examined differences in expression in the transcriptome in the context of benzene exposure. Using a relatively small sample size of six matched pairs of exposed and control subjects, we have been able to identify differentially expressed genes in the PBMC of benzene-exposed individuals that could be confirmed and measured by real-time PCR. A global analysis of 45,000 probe sets, representing approximately 33,000 well-substantiated human genes, was performed on the GeneChips using stabilized PBMC RNA collected in the field in China as part of a large molecular epidemiology study of benzene-exposed workers (Vermeulen et al. 2004). Although the results will differ based on both the type of processing (e.g., RMA) and adjustment for multiple testing (e.g., FWER with bootstrapping), our results showed that, in the six pairs examined, a potentially large number (> 2,100) of probe sets were (statistically) differentially expressed in the benzene-exposed subjects compared with the control, unexposed subjects. Because the accuracy of this bootstrapping technique is based on asymptotic theory Asymptotic theory is the branch of mathematics which studies properties of asymptotic expansions. The most known result of this field is the prime number theorem: Let π(x) be the number of prime numbers that are smaller than or equal to x. , a 5% FWER is not guaranteed, and thus the statistical results should not be the only criterion for identifying genes for more detailed study. However, by ranking the differentially expressed probe sets identified in the global gene analysis by unadjusted p-value, we were able to identify the top 50 that were highly likely to be differentially expressed. We chose four of the known genes from this list for confirmation by real-time PCR. We also increased our probability of finding genes altered by benzene exposure by performing an analysis of the limited subset of 508 cytokine probe sets on the GeneChips. The equivalent analysis of this subgroup indicated that the expression of 19 cytokine genes was significantly different between the exposed and control subjects, and two of these were chosen for confirmation by real-time PCR. Genes were chosen for real-time PCR confirmation based on the availability of TMGEA, fold change of expression, and expected copy number. Genes thought to be expressed at very low levels were avoided because these have much higher relative measurement error rates (Novak et al. 2002) and because availability of RNA for confirmation was limited. The six genes chosen for further study were CXCL16, JUN, PTPRE, ZNF331, IL4R, and PF4. TBP was chosen as the endogenous control gene because recent research suggests it is well suited for real-time PCR investigations of lymphocytes (Lossos et al. 2003) and it is on a different chromosome (6q) from the other genes investigated. This means that gross genetic events (chromosome duplications, deletions, etc.), which may alter the copy number of the genes investigated, will not affect the control gene. GeneChip findings in the six pairs of subjects for ZNF331, CXCL16, PF4, and JUN were all shown to be concordant with real-time PCR data. For IL4R and PTPRE there was less consistency between the GeneChip findings and those by real-time PCR in the six pairs (Figure 1). Low copy number can be ruled out as an explanation for the discrepancies between GeneChip and real-time PCR findings for IL4R and PTPRE because they were detected at levels similar to those of the other four genes. One possible explanation is that the Affymetrix probes for these genes are at the 3' end of transcripts whereas the probes for the IL4R and PTPRETMGEAs span exons farther upstream. The different target sequences might explain the discrepancies found in relative expression. Results of real-time PCR using the second-round cRNA used for hybridization to the GeneChips were not significantly different from those described in Figure 1 (data not shown). This suggests that differences were not caused by use of the Small Sample Target Labeling Assay Version II protocol. In a larger set of 28 study subjects, all four concordant genes were shown to be consistently altered by benzene exposure: CXCL16 and ZNF331 were up-regulated, whereas JUN and PF4 were down-regulated. Alteration in the expression of any of the four genes could be a consequence of upstream events. However, it is also possible that germline variation in one or more regulatory regions In biochemistry, a regulatory region is a DNA base sequence that controls gene expression. Overview A gene is a stretch of DNA that codes for the creation of a particular protein, such as an enzyme. of these four genes could be particularly susceptible to the effects of benzene exposure. Further studies are needed to investigate genetic variation across each of these genes to determine if one or more variants could be functionally important in benzene exposure. The identification of interactions between genetic variants and benzene's effects could lead to further insights into the mechanisms associated with benzene-induced leukemia and other hematologic hematological, hematologic pertaining to or emanating from blood cells. hematological tests total and differential white cell counts, hematocrit estimation, erythrocyte count. diseases. CXCL16 is also known as SR-PSOX or CXCLG16 and maps to chromosome 17p13. It encodes chemokine (C-X-C motif) ligand 16, a scavenger receptor Scavenger receptors is a group of receptors that recognize modified low density lipoprotein (LDL) by oxidation or acetylation. This naming is based on a function of cleaning (scavenging): scavenger receptors widely recognize and uptake macromolecules having a negative charge as that mediates adhesion and phagocytosis phagocytosis: see endocytosis. Phagocytosis A mechanism by which single cells of the animal kingdom, such as smaller protozoa, engulf and carry particles into the cytoplasm. of both Gram-negative and Gram-positive bacteria. This facilitates the uptake of various pathogens and chemotaxis chemotaxis: see taxis. of T cell and NK T cells T cells A type of white blood cell produced in the thymus gland. T cells are an important part of the immune system. Infants born with an underdeveloped or absent thymus do not have a normal level of T cells in their blood. by antigen-presenting cells through its chemokine domain (Shimaoka et al. 2003). ZNF331 is a member of the Kruppel-related family of zinc finger proteins that contain a Kruppel associated box (KRAB) domain and is likely a transcriptional repressor repressor: see nucleic acid. (Meiboom et al. 2003). The ZNF331 gene lies close to a frequent breakpoint The location in a program used to temporarily halt the program for testing and debugging. Lines of code in a source program are marked for breakpoints. When those instructions are about to be executed, the program stops, allowing the programmer to examine the status of the program region of follicular fol·lic·u·lar adj. 1. Relating to, having, or resembling a follicle or follicles. 2. Affecting or growing out of a follicle or follicles. thyroid adenomas (Meiboom et al. 2003), but the question of why benzene should so markedly affect ZNF331 expression remains unclear at present. The JUN, FOS FOS abbr. free on steamer , MAF MAF macrophage activating factor. , and A TF subfamilies are dimeric, basic region-leucine zipper zipper Device for binding the edges of an opening, as on a garment or a bag. A zipper consists of two strips of material with metal or plastic teeth along the edges, and a sliding piece that interlocks the teeth when moved in one direction and separates them again when moved proteins that make up the AP-1 transcription factor Please [improve the article] or discuss this issue on the talk page. . AP-1 transcription factors (Shaulian and Karin 2002) are involved in both the induction and prevention of apoptosis, depending on tissue type (Shanlian and Karin 2002). As part of AP-1, JUN is primarily a positive regulator of proliferation. JUN-deficient fibroblasts Fibroblasts A type of cell found in connective tissue; produces collagen. Mentioned in: Skin Grafting have marked proliferative defects in vitro in vitro /in vi·tro/ (in ve´tro) [L.] within a glass; observable in a test tube; in an artificial environment. in vi·tro adj. In an artificial environment outside a living organism. (Schreiber et al. 1999; Wisdom et al. 1999), and proliferation of JUN-deficient hepatocytes is severely impaired during liver regeneration in vivo (Bakiri et al. 2000; Behrens et al. 2002; Schreiber et al. 1999). In mouse erythroleukemia erythroleukemia /eryth·ro·leu·ke·mia/ (e-rith?ro-loo-ke´me-ah) a malignant blood dyscrasia, one of the myeloproliferative disorders, with atypical erythroblasts and myeloblasts in the peripheral blood. and fibroblast fibroblast /fi·bro·blast/ (fi´bro-blast) 1. an immature fiber-producing cell of connective tissue capable of differentiating into chondroblast, collagenoblast, or osteoblast. 2. cells, inhibition of fos and jun has demonstrated their requirement for proliferation and cell-cycle progression (Shanlian and Karin 2001). The lower levels of JUN could thus be indicative that the PBMCs of benzene-exposed individuals are not proliferating or progressing through the cell cycle as quickly as those of nonexposed individuals. PF4, also known as CXCL4, is a polypeptide polypeptide: see peptide. constituent of platelet alpha granules that is released during platelet aggregation Platelet aggregation The clumping together of blood cells, possibly forming a clot. Mentioned in: Herbalism, Traditional Chinese and inhibits heparin-mediated reactions. PF4 has been shown to have numerous other biologic properties, including inhibiting endothelial endothelial /en·do·the·li·al/ (-the´le-al) pertaining to or made up of endothelium. Endothelial A layer of cells that lines the inside of certain body cavities, for example, blood vessels. cell proliferation, migration, and angiogenesis angiogenesis /an·gio·gen·e·sis/ (-jen´e-sis) vasculogenesis; development of blood vessels either in the embryo or in the form of neovascularization or revascularization. an·gi·o·gen·e·sis n. (Gupta and Singh 1994; Malone et al. 1990; Niewiarowski et al. 1976) and inhibiting T-cell function by down-modulating cell proliferation and cytokine release (Fleischer et al. 2002). PF4 is expressed exclusively in platelets, megakaryocytes, and their precursors (Doi et al. 1987), and its down-regulation in benzene-exposed workers probably reflects decreased expression in platelets or progenitor cells because they are present in the PBMC fraction. We explored whether increased and decreased expression of these genes after benzene exposure was a reflection an alteration in the subset make up of the PBMC population. Although benzene exposure did cause changes in the subset makeup of the PBMC fraction (Table 5), these changed proportions had no impact on expression of CXCL16, ZNF331, JUN, and PF4, and the strength and direction of the association between benzene exposure and gene expression was minimally changed after adjusting for both CD[8.sup.+] T-cell and B-cell mononuclear cell counts and percentages. Thus, the altered expression was not likely to be caused by changes in the make up of the PBMC fraction. Unfortunately, the subjects studied here were selected to have a high level of benzene exposure to make this initial exploratory effort as efficient as possible by maximizing the contrast between the exposed workers and controls. Consequently, the exposure range was too narrow to be able to detect a dose-response relationship The Dose-response relationship describes the change in effect on an organism caused by differing levels of exposure (or doses) to a stressor (usually a chemical). This may apply to individuals (eg: a small amount has no observable effect, a large amount is fatal), or to populations among exposed workers, which was not a goal of this study. In the future, we plan to analyze substantially more samples selected from workers with a wide range of benzene exposure to allow us to construct a detailed model of the dose-response relationship. Generating relative expression using RMA combined with a bootstrapping method for controlling the FWER appears to be an effective way to identify genes associated with chemical exposure. The relative expression of a subset of six genes (all selected as statistically differentially expressed from GeneChips) were confirmed by real-time PCR in either all or most of the six exposed-unexposed pairs analyzed and in a larger data set from 28 subjects. There was also remarkable consistency between the real-time data and the differential expression ratios calculated by RMA for at least four of these genes. Larger data sets will be needed if we are to characterize a pattern of gene expression related to benzene exposure using machine-learning algorithms. However, we did attempt to explore gene ontology The Gene Ontology project, or GO, provides a controlled vocabulary to describe gene and gene product attributes in any organism. It can be broadly split into two parts. with the program EASE (Expression Analysis Systematic Explorer; Hosack et al. 2003) using the current data set and found that immune response genes immune response gene n. A gene in the major histocompatibility complex that controls a cell's immune response to specific antigens. gave the largest number of significant population hits, supporting our decision to analyze cytokine genes as a subset. In conclusion, we have shown that microarray analysis can be a good tool for discovering genes of potential mechanistic interest or biomarkers of exposure and early effect in molecular epidemiologic studies of populations exposed to potential carcinogens Carcinogens Substances in the environment that cause cancer, presumably by inducing mutations, with prolonged exposure. Mentioned in: Colon Cancer, Rectal Cancer . Further, only small numbers of paired study subjects are required to identify differentially expressed genes, making such studies cost-effective. The small-sample protocol used here also limits the amount of high-quality RNA required, meaning that archived samples, stored by partial isolation and stabilization of the RNA in the field, are amenable to analysis. These studies therefore provide a model for biomarker discovery Biomarker discovery is the process by which biomarkers are discovered. It is a medical term. Many commonly used blood tests in medicine are biomarkers. The way that these tests have been found can be seen as biomarker discovery. in chemically exposed human populations, although with lower exposed populations it may be necessary to study more subject pairs, with 15 pairs probably being ideal. Because the price of global gene expression arrays is decreasing, such studies are becoming more feasible.
Table 1. List of top 25 probe sets up-regulated by benzene exposure
identified on U133 chips. (a)
Probe set ID Gene symbol Location
207630_s_at CREM 10p12.1-p11.1
221840_at PTPRE 10g26
219228_at ZNF331 19q13.3-q13.4
204924_at TLR2 4q32
227613_at ZNF331
223454_at CXCL16 17p13
228962_at PDE4D
214696_at MGC14376 17p13.3
210732_s_at LGALS8 1q42-q43
212371_at PNAS-4
225390_s_at KLF13
227645_at P101-PIN 17p13.1
226652_at USP3
221641_s_at ACATE2 Xp22.13
202055_at KPNA1
226743_at FLJ34922 17q12
228393_s_at ZNF302
225120_AT PURB
218515_at C21orf66 21q21.3
202224_at CRK
200614_at CL TC 17q11-qter
212014_x_at CD44 11p13
223461_at TBC1D7 6p23
209835_x_at CD44 11p13
213315_x_at LOC91966 Xq28
Probe set ID Gene description
207630_s_at cAMP responsive element modulator
221840_at protein tyrosine phosphatase receptor type E (b)
219228_at zinc finger protein 331 (b)
204924_at toll-like receptor 2
227613_at zinc finger protein 331
223454_at chemokine (C-X-C motif) ligand 16 (b)
228962_at phosphodiesterase 4D, cAMP-specific
(phosphodiesterase E3 dunce homolog, Drosophila)
214696_at hypothetical protein MGC14376
210732_s_at lectin, galactoside-binding, soluble, 8 (galectin 8)
212371_at CGI-146 protein
225390_s_at Kruppel-like factor 13
227645_at phosphoinositide-3-kinase, regulatory subunit,
polypeptide p101
226652_at ubiquitin specific protease 3
221641_s_at likely ortholog of mouse acyl-coenzyme A
thioesterase 2, mitochondrial
202055_at karyopherin alpha 1 (importin alpha 5)
226743_at hypothetical protein FLJ34922
228393_s_at zinc finger protein 302
225120_AT purine-rich element binding protein B
218515_at chromosome 21 open reading frame 66
202224_at v-crk sarcoma virus CT10 oncogene homolog (avian)
200614_at clathrin, heavy polypeptide (Hc)
212014_x_at CD44 antigen (homing function and Indian blood group
system)
223461_at TBC1 domain family, member 7
209835_x_at CD44 antigen (homing function and Indian blood group
system)
213315_x_at hypothetical protein LOC91966
Probe set ID p-Value Ratio (GM)
207630_s_at 3.97 x [10.sup.-4] 2.45
221840_at 7.77 x [10.sup.-4] 2.07
219228_at 4.49 x [10.sup.-4] 2.02
204924_at 5.09 x [10.sup.-4] 2.01
227613_at 3.73 x [10.sup.-4] 1.97
223454_at 3.93 x [10.sup.-4] 1.96
228962_at 7.11 x [10.sup.-4] 1.85
214696_at 1.71 x [10.sup.-4] 1.78
210732_s_at 5.49 x [10.sup.-4] 1.76
212371_at 7.61 x [10.sup.-4] 1.7
225390_s_at 4.34 x [10.sup.-4] 1.69
227645_at 4.71 x [10.sup.-4] 1.66
226652_at 5.48 x [10.sup.-4] 1.64
221641_s_at 6.01 x [10.sup.-4] 1.63
202055_at 2.89 x [10.sup.-4] 1.61
226743_at 7.42 x [10.sup.-4] 1.6
228393_s_at 6.64 x [10.sup.-4] 1.58
225120_AT 3.90 x [10.sup.-4] 1.58
218515_at 5.80 x [10.sup.-4] 1.56
202224_at 6.45 x [10.sup.-5] 1.55
200614_at 6.33 x [10.sup.-4] 1.55
212014_x_at 3.08 x [10.sup.-4] 1.54
223461_at 6.75 x [10.sup.-4] 1.51
209835_x_at 2.69 x [10.sup.-4] 1.51
213315_x_at 7.54 x [10.sup.-4] 1.49
GM, geometric mean.
(a) Top 25 up-regulated probe sets were selected by RMA on the basis
of p-value and then ranked according to expression ratio. Gene
annotations are from NetAffx
(http://www.affymetrix.com/analysis/index.affx). (b) Genes chosen for
further analysis by real-time PCR.
Table 2. List of top 25 probe sets down-regulated by benzene exposure
identified on U133 chips. (a)
Probe set ID Gene symbol Location
200800_s_at HSPA1A 6p21.3
242904_x_at MGC8721 8p12
213281_at JUN 1p32-p31
229264_at FLJ39739
237510_at MYNN 3q26.31
229054_at FLJ39739
202732_at PKIG 20g12-q13.1
229872_s_at KIAA0493 8q24.13
230574_at
224495_at MGC10744 17p13.1
243_g_at MAP4 3p21
244741_s_at MGC9913 19q13.43
221419_s_at
219503_s_at FLJ11036 3p25.1
240406_at USP16 21q22.11
241749_at 9q31.1
228932_at
227667_at
239063_at
236509_at
200655_s_at CALM1 14q24-q31
221384_at UCP1 4q28-q31
203834_s_at TGOLN2 2p11.2
225122_at RNF31 14q11.2
229975_at
Probe set ID Gene description
200800_s_at heat shock 70 kDa protein 1A
242904_x_at hypothetical protein MGC8721
213281_at v-jun sarcoma virus 17 oncogene homolog (avian) (b)
229264_at FLJ39739 protein (M. musculus) S00030 neurofilament
triplet M protein, mouse
237510_at myoneurin
229054_at FLJ39739 protein
202732_at protein kinase (cAMP-dependent, catalytic) inhibitor
gamma
229872_s_at Homo sapiens cDNA FLJ39739 fis, clone SMINT2016440
230574_at Homo sapiens transcribed sequences
224495_at hypothetical protein MGC10744
243_g_at microtubule-associated protein 4
244741_s_at hypothetical protein MGC9913
221419_s_at
219503_s_at hypothetical protein FLJ11036
240406_at ubiquitin specific protease 16
241749_at similar to RIKEN cDNA 2310039E09
228932_at Homo sapiens transcribed sequence with moderate
similarity to protein sp:P39194 (H. sapiens)
ALU7_HUMAN Alu subfamily SQ sequence contamination
warning entry
227667_at Homo sapiens transcribed sequence with weak similarity
to protein pir:B36298 (H. sapiens) B36298 proline-
rich protein PRB3S (cys)--human (fragment)
239063_at Homo sapiens cDNA FLJ39803 fis, clone SPLEN2007794
236509_at Homo sapiens transcribed sequences
200655_s_at calmodulin 1 (phosphorylase kinase, delta)
221384_at uncoupling protein 1 (mitochondrial, proton carrier)
203834_s_at trans-Golgi network protein 2
225122_at ring finger protein 31
229975_at Homo sapiens transcribed sequence with weak similarity
to protein ref:NP_060312.1
Probe set ID p-Value Ratio (GM)
200800_s_at 4.38 x [10.sup.-4] 0.34
242904_x_at 4.42 x [10.sup.-4] 0.43
213281_at 6.56 x [10.sup.-4] 0.51
229264_at 3.06 x [10.sup.-4] 0.56
237510_at 2.17 x [10.sup.-4] 0.59
229054_at 6.64 x [10.sup.-4] 0.59
202732_at 3.24 x [10.sup.-4] 0.65
229872_s_at 1.48 x [10.sup.-4] 0.67
230574_at 1.13 x [10.sup.-4] 0.7
224495_at 8.17 x [10.sup.-4] 0.73
243_g_at 1.41 x [10.sup.-4] 0.75
244741_s_at 4.67 x [10.sup.-4] 0.76
221419_s_at 6.82 x [10.sup.-4] 0.77
219503_s_at 1.21 x [10.sup.-4] 0.77
240406_at 4.19 x [10.sup.-4] 0.8
241749_at 2.96 x [10.sup.-4] 0.81
228932_at 4.08 x [10.sup.-4] 0.82
227667_at
1.56 x [10.sup.-4] 0.82
239063_at 3.51 x [10.sup.-5] 0.83
236509_at 4.61 x [10.sup.-4] 0.83
200655_s_at 1.23 x [10.sup.-4] 0.83
221384_at 9.86E-06 0.84
203834_s_at 5.44 x [10.sup.-4] 0.85
225122_at 1.87 x [10.sup.-4] 0.86
229975_at
8.19 x [10.sup.-4] 0.86
GM, geometric mean.
(a) Top 25 down-regulated probe sets were selected by RMA on the basis
of p-value and then ranked in the table according to expression ratio.
Gene annotations are from NetAffx (http://www.affymetrix.com/analysis/
index.affx). (b) JUN was chosen for further analysis by real-time PCR.
Table 3. List of cytokine probe sets identified by U133 GeneChips from
the cytokine subset that were significantly different in
benzene-exposed and unexposed individuals. (a)
Probe set ID Gene symbol Gene description
243296_at PBEF pre-B-cell colony-enhancing factor
211676_s_at IFNGR1 interferon gamma receptor 1
223454_at CXCL16 chemokine (C-X-C motif) ligand 16 (b)
217738_at PBEF pre-B-cell colony-enhancing factor
201408_at PPP1CB protein phosphatase 1, catalytic subunit,
beta isoform
213743_at CCNT2 cyclin T2
226333_at IL6R interleukin 6 receptor
209827_s_at IL16 interleukin 16 (lymphocyte
chemoattractant factor)
203233_at IL4R interleukin 4 receptor
228222_at PPP1CB protein phosphatase 1, catalytic subunit,
beta isoform
205945_at IL6R interleukin 6 receptor
204912_at IL10RA interleukin 10 receptor, alpha
205291_at IL2RB interleukin 2 receptor, beta
224914_s_at CIP29 cytokineinduced protein 29 kDa
202727_s_at IFNGR1 interferon gamma receptor 1
204773_at IL11RA interleukin 11 receptor, alpha
204645_at CCNT2 cyclin T2
223961_s_at CISH cytokine inducible SH2-containing protein
206359)at SOCS3 suppressor of cytokine signaling 3
Down-regulated gene
expression
210354_at IFNG interferon, gamma
206390_x_at PF4 platelet factor 4 [chemokine (C-X-C
motif) ligand 4] (b)
209767_s_at GP1BB glycoprotein lb (platelet), beta
polypeptide
201896_s_at DDA3 differential display and activated by p53
242254_at Homo sapiens transcribed sequence with
moderate similarity to protein
ref:NP_071431.1 (H. sapiens) cytokine
receptor-like factor 2; cytokine
receptor CRL2 precursor
213258_at TFPI tissue factor pathway inhibitor
(lipoprotein-associated coagulation
inhibitor)
244848_at CDNA FLJ31075 fis, clone HSYRA2001484
235889_at Homo sapiens transcribed sequence with
moderate similarity to protein
ref:NP_060312.1 (H.sapiens)
hypothetical protein FLJ20489
243438_at PDE7B phosphodiesterase 7B
Probe set ID p-Value Ratio (GM)
243296_at 0.0275 2.17
211676_s_at 0.0037 2.16
223454_at 0.0004 1.96
217738_at 0.0206 1.93
201408_at 0.0017 1.72
213743_at 0.0080 1.68
226333_at 0.0042 1.57
209827_s_at 0.0025 1.53
203233_at 0.0009 1.49
228222_at 0.0206 1.43
205945_at 0.0269 1.40
204912_at 0.0044 1.39
205291_at 0.0113 1.39
224914_s_at 0.0115 1.38
202727_s_at 0.0146 1.32
204773_at 0.0111 1.23
204645_at 0.0149 1.22
223961_s_at 0.0244 1.16
206359)at 0.0229 1.08
Down-regulated gene expression
210354_at 0.0149 0.35
206390_x_at 0.0108 0.62
209767_s_at 0.0204 0.77
201896_s_at 0.0232 0.80
242254_at 0.0187 0.84
213258_at 0.0124 0.85
244848_at 0.0282 0.88
235889_at 0.0131 0.89
243438_at 0.0019 0.91
GM, geometric mean.
(a) Gene annotations are from NetAffx (http://www.affymetrix.com/
analysis/index.affx). (b) Gene chosen for further analysis by
real-time PCR.
Table 4. Gene expression measured by real-time PCR in the larger
set of 28 benzene-exposed workers and controls.
Control Exposed
Characteristics (n = 15) (n = 13)
Sex
Male 8 (53%) 6 (46%)
Female 7 (47%) 7 (54%)
Age (years)
Mean [+ or -] SD 32.5 [+ or -] 8.8 35.1 [+ or -] 6.1
Median 33 36
Current smoking
Yes 5 (33%) 4 (31%)
No 10 (67%) 9 (69%)
Recent infection
Yes 1 (7%) 1 (8%)
No 14 (93%) 12 (92%)
Benzene exposure
Air level (ppm) < 0.04 43.66 [+ or -] 23.87
Urine level 0.36 [+ or -] 0.51 778.70 [+ or -] 1433.02
([micro]g/L)
Up-regulated genes
CXCL16 2.37 [+ or -] 0.62 (c) 3.66 [+ or -] 0.54
2.55 (d) 3.9
ZNF331 1.50 [+ or -] 0.70 3.00 [+ or -] 0.91
1.61 2.92
Down-regulated
genes
JUN 6.29 [+ or -] 1.00 4.94 [+ or -] 1.26
6.57 4.66
PF4 6.05 [+ or -] 0.90 5.46 [+ or -] 1.35
6.24 5.18
p-Value (b)
Characteristics p-Value (a) (Unadjusted)
Sex
Male
Female
Age (years)
Mean [+ or -] SD
Median
Current smoking
Yes
No
Recent infection
Yes
No
Benzene exposure
Air level (ppm)
Urine level
([micro]g/L)
Up-regulated genes
CXCL16 0.00001 <0.0001
ZNF331 0.00001 <0.0001
Down-regulated
genes
JUN 0.0037 0.0038
PF4 0.052 0.012 (e)
(a) By Wilcoxon exact test. (b) Analyzed by linear regression.
Results are unadjusted because age, sex, current smoking, recent
infections, and alcohol use did not weaken the effect of benzene
exposure on any gene transcript. (c) Data presented as mean
[+ or -] SD. (d) Data presented as median. (e) 0ne subject with
an extreme outlier value was excluded from the regression analysis.
Table 5. Subsets of lymphocytes and monocytes in the PBMC fraction
of the 28 subjects comprising the larger study population.
Cell types Control (n = 15)
Mononucleocytes/L 2240 [+ or -] 540 (b)
Monocytes/mononucleocytes (%) 8.24 [+ or -] 2.43
CD[4.sup.+] T cells/ 34.08 [+ or -] 7.95
mononucleocytes (%)
CD[8.sup.+] T cells/ 25.32 [+ or -] 6.98
mononucleocytes (%)
CD19 cells/mononucleocytes (%) 10.14 [+ or -] 2.27
CD56 cells/mononucleocytes (%) 22.22 [+ or -] 10.42
Cell types Exposed (n = 13) p-Value (a)
Mononucleocytes/L 1704 [+ or -] 393 0.0052
Monocytes/mononucleocytes (%) 8.53 [+ or -] 2.06 0.56
CD[4.sup.+] T cells/ 33.01 [+ or -] 5.62 0.86
mononucleocytes (%)
CD[8.sup.+] T cells/ 33.15 [+ or -] 7.13 0.0096
mononucleocytes (%)
CD19 cells/mononucleocytes (%) 6.79 [+ or -] 5.00 0.0061
CD56 cells/mononucleocytes (%) 18.52 [+ or -] 4.20 0.62
(a) By Wilcoxon exact test. (b) Data presented as mean [+ or -] SD.
REFERENCES Affymetrix. 2003a. GeneChip Eukaryotic Small Sample Target Labeling Assay Version II. Santa Clara, CA:Affymetrix. Available: http://www.affymetrix.com/support/technical/ manual/expression_manual.affx [accessed 16 January 2003]. Affymetrix. 2003b. GeneChip Expression Analysis Technical Manual. Santa Clara, CA:Affymetrix. Available: http:// www.affymetrix.com/support/technical/manual/ expression_manual.affx [accessed 16 January 2003]. Aoyama K. 1986. Effects of benzene inhalation on lymphocyte subpopulations and immune response in mice. Toxicol Appl Pharmacol 85:92-101. Bakiri L, Lallemand D, Bossy-Wetzel E, Yaniv M. 2000. Cell cycle-dependent variations in c-Jun and JunB phosphorylation phosphorylation, chemical process in which a phosphate group is added to an organic molecule. In living cells phosphorylation is associated with respiration, which takes place in the cell's mitochondria, and photosynthesis, which takes place in the chloroplasts. : a role in the control of cyclin D Cyclin D is a member of the cyclin family. It regulates cyclin-dependent kinase types 4 and 6. External links
• • 1 expression. EMBO J 19:2056-2068. Behrens A, Sibilia M, David JP, Mohle-Steinlein U, Tronche F, Schutz G, et al. 2002. impaired postnatal postnatal /post·na·tal/ (-na´t'l) occurring after birth, with reference to the newborn. post·na·tal adj. Of or occurring after birth, especially in the period immediately after birth. hepatocyte hepatocyte /hep·a·to·cyte/ (hep´ah-to-sit?) a hepatic cell. hep·a·to·cyte n. A parenchymal liver cell. Hepatocyte A liver cell. proliferation and liver regeneration in mice lacking c-jun in the liver. EMBO J 21:1782-1790. Doi T, Greenberg SM, Rosenberg RD. 1987. Structure of the rat platelet factor 4 gene: a marker for megakaryocyte megakaryocyte /mega·karyo·cyte/ (-sit?) the giant cell of bone marrow containing a greatly lobulated nucleus, from which mature blood platelets originate.megakaryocyt´ic meg·a·kar·y·o·cyte n. differentiation. Mol Cell Biol 7:896-904. Dudoit S, van der Lean MJ, Pollard KS. 2004. Multiple testing. Part I. Single-step procedures for control of general type I error rates. Star Appl Genet genet: see civet. Mol Biol 3(1):Article 13. Available: http://www.bepress.com/sagmh/vol3/iss1/art13/ [accessed 2 September 2004]. Etienne W, Meyer MH, Peppers J, Meyer RA Jr. 2004. Comparison of mRNA gene expression by RT-PCR RT-PCR reverse transcriptase-polymerase chain reaction. See PCR1. and DNA microarray DNA microarray A small solid support, usually a membrane or glass slide, on which sequences of DNA are fixed in an orderly arrangement. DNA microarrays are used for rapid surveys of the expression of many genes simultaneously, as the sequences contained on a . Biotechniques 36:616-626. Fleischer J, Grage-Griebenow E, Kasper B, Heine H, Ernst M, Brandt E, et al. 2002. Pletelet factor 4 inhibits proliferation and cytokine release of activated human T cells. J Immonol 169:770-777. Gist GL, Burg JR. 1997. Benzene--a review of the literature from a health effects perspective. Toxicol Ind Health 13:661-714. Gupta SK, Singh JP. 1994. Inhibition of endothelial cell proliferation by platelet factor-4 involves a unique action on S phase progression. J Cell Biol 127:1121-1127. Hamadeh HK, Amin RP, Paules RS, Afshari CA. 2002. An overview of toxicogenomics. Curr Issues Mol Biol 4:45-56. Hosack DA, Dennis G Jr, Sherman BT, Lane HC, Lempicki RA. 2003. Identifying biological themes within lists of genes with EASE. Genome Biol 4(10):R70-R70.8. Iacobuzio-Donahue CA, Ashfaq R, Maitre A, Adsey NV, Shen-Ong GL, Berg K, et al. 2003. Highly expressed genes in pancreatic ductal adenocarcinomas: a comprehensive characterization and comparison of the transcription profiles obtained from three major technologies. Cancer Res 63:8614-8622. Irizarry RA, Bolstad BM, Collin F, Cope LM, Hobbs B, Speed TP. 2003. Summaries of Affymetrix GeneChip probe level data. Nucleic Acids Nucleic acids The cellular molecules DNA and RNA that act as coded instructions for the production of proteins and are copied for transmission of inherited traits. Res 31:e15. Krewski D, Snyder B, Beatty P, Granville G, Meek B, Sonawane B. 2000. Assessing the health risks of benzene: a report on the benzene state-of-the-science workshop. J Toxicol Environ Health A 61:307-338. Lampe JW, Stepeniants SB, Meg M, Radich JP, Dai H, Linsley PS, et al. 2004. Signatures of environmental exposures using peripheral leukocyte leukocyte (l `kəsīt'): see blood. leukocyte or white blood cell or white corpuscle gene expression: tobacco smoke. Cancer Epidemiol Biomarkers Prev 13:445-453. Lossos IS, Czerwinski DK, Wechser MA, Levy R. 2003. Optimization of quantitative real-time RT-PCB parameters for the study of lymphoid lymphoid /lym·phoid/ (lim´foid) resembling or pertaining to lymph or tissue of the lymphoid system. lym·phoid adj. Of or relating to lymph or the lymphatic tissue where lymphocytes are formed. malignancies. Leukemia 17:780-795. Maione TE, Gray GS, Petro J, Hunt AJ, Donner AL, Bauer SI, et al. 1990. Inhibition of angiogenesis by recombinant human platelet factor-4 and related peptides. Science 247:77-79. Meiboom M, Murua Escobar H, Pentimalli F, Fusco A, Beige G, Bullerdiek J. 2003. A 3.4-kbp transcript of ZNF331 is solely expressed in follicular thyroid adenomas. Cytogenet Genome Res 101(2):113-117. Niewiarowski S, Lowery low·er·y also lour·y adj. Overcast; threatening. CT, Hawiger J, Millman M, Timmons S. 1976. Immunoassay Immunoassay An assay that quantifies antigen or antibody by immunochemical means. The antigen can be a relatively simple substance such as a drug, or a complex one such as a protein or a virus. of human platelet factor 4(PF4, anti-heparin factor) by radial immunodiffusion. J Lab Clin Med 87:720-733. Novak JP, Sladek B, Hudson TJ. 2002. Characterization of variability in large-scale gene expression data: implications for study design. Genomics 79:104-113. Pollard KS, Van der Lean MJ. 2003. Resampling-based Multiple Testing: Asymptotic Control of Type I Error and Applications to Gene Expression Data. Division of Biostatistics Working Paper 121. Berkeley, CA:University of California The University of California has a combined student body of more than 191,000 students, over 1,340,000 living alumni, and a combined systemwide and campus endowment of just over $7.3 billion (8th largest in the United States). Berkeley. Available: http://www.bepress.com/ocbiostat/peper121 [accessed 2 September 2004]. Qu Q., Shore R, Li G, Jin X, Chert chert: see flint. LC, Cohen cohen or kohen (Hebrew: “priest”) Jewish priest descended from Zadok (a descendant of Aaron), priest at the First Temple of Jerusalem. The biblical priesthood was hereditary and male. B, et al. 2002. Hematological hematological, hematologic pertaining to or emanating from blood cells. hematological tests total and differential white cell counts, hematocrit estimation, erythrocyte count. changes among Chinese workers with a broad range of benzene exposures. Am J Ind Med 42:275-285. Rothman N, Li GL, Dosemeci M, Bechtold WE, Merti GE, Wang YZ, et al. 1996. Hematotoxocity among Chinese workers heavily exposed to benzene. Am J Ind Med 29:230-246. Schreiber M, Kolbus A, Piu F, Szabowski A, Mohle-Steinlein U, Tian Tian or T'ien (Chinese; “Heaven”) In indigenous Chinese religion, the supreme power reigning over humans and lesser gods. The term refers to a deity, to impersonal nature, or to both. J, et al. 1999. Control of cell cycle progression by c-Jun is p53 dependent. Genes Dev 13:607-619. Shaulian E, Kerin M. 2001. AP-1 in cell proliferation and survival. Oncogene 20:2390-2400. Shaulian E, Karin M. 2002. AP-1 as a regulator of cell life and death. Nat Cell Biol 4:E131-E136. Shimaoka T, Nakayama T, Kume N, Takahashi S, Yamaguchi J, Minami M, et al. 2003. Cutting edge: SR-PSOX/CXC chemokine ligand 16 mediates bacterial phagocytosis by APCs through its chemokine domain. J Immunol 171: 1647-1651. Snyder R. 2002. Benzene and leukemia. Crit Rev Toxicol 32:155-210. Staudt LM. 2003. Molecular diagnosis of the hematologic cancers. N Engl J Med 348:1777-1785. Staudt LM, Brown PO. 2000. Genomic views of the immune system. Annu Rev Immunol 18:829-859. Thach DC, Lin B, Walter E, Kruzelock R, Rowley RK, Tibbetts C, et al. 2003. Assessment of two methods for handling blood in collection tubes with RNA stabilizing agent for surveillance of gene expression profiles with high density microarrays. J Immunol Methods 263:266-279. Vermeulen R, Li G, Lan Q, Dosemeci M, Rappaport SM, Bohong X, et al. 2004. Detailed exposure assessment for e molecular epidemiology study of benzene in two shoe factories in China. Ann Occup Hyg 48:105-116. Waters MD, Olden old·en adj. Of, relating to, or belonging to time long past; old or ancient: olden days. [Middle English : old, old; see old + -en, adj. K, Tennant RW. 2003. Toxicogenomic approach for assessing toxicant-related disease. Mutat Res 544:415-424. Whitney AR, Diehn M, Popper An early Unix POP server, which was written at the University of California at Berkeley. SJ, Alizadeh AA, Boldrick JC, Relman DA, et al. 2003. Individuality and variation in gene expression patterns in human blood. Proc Natl Acad Sci USA 100:1896-1901. Wisdom R, Johnson RS, Moore C. 1999. c-Jun regulates cell cycle progression and apoptosis by distinct mechanisms. EMBO J 18:188-197. Wu MM, Chiou HY, Ho IC, Chen CJ, Lee TC. 2003. Gene expression of inflammatory molecules in circulating lymphocytes from arsenic-exposed human subjects. Environ Health Perspect 111:1426-1438. Matthew S. Forrest, (1) Qing Lan, (2) Alan E. Hubbard, (1) Luoping Zhang, (1) Roel Vermeulen, (2) Xin Zhao, (1) Guilan Li, (3) Yen-Ying Wu, (1) Min Shen Shen, in the Bible, place, perhaps close to Bethel, near which Samuel set up the stone Ebenezer. , (2) Songnian Yin, (3) Stephen J. Chanock, (2) Nathaniel Rothman, (2) and Martyn T. Smith (1) (1) School of Public Health, University of California, Berkeley The University of California, Berkeley is a public research university located in Berkeley, California, United States. Commonly referred to as UC Berkeley, Berkeley and Cal , California, USA; (2) Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, Maryland, USA; (3) National Institute of Occupational Health and Poison Control, Chinese Center for Disease Control and Prevention Noun 1. Center for Disease Control and Prevention - a federal agency in the Department of Health and Human Services; located in Atlanta; investigates and diagnoses and tries to control or prevent diseases (especially new and unusual diseases) CDC , Beijing, China Address correspondence to M.T. Smith, School of Public Health, 140 Warren Hall, University of California, Berkeley, CA 94720-7360 USA. Telephone: (510) 642-8770. Fax: (510) 642-0427. E-mail: martynts@berkeley.edu We thank H. Asahara for assistance with the Affymetrix GeneChip hybridizations. This work was supported by National Institutes of Health grants RO1ES006721 and P30ES001896 to M.T.S. The authors declare competing financial interests. M.T.S. has received consulting and expert testimony Testimony about a scientific, technical, or professional issue given by a person qualified to testify because of familiarity with the subject or special training in the field. fees from law firms representing both plaintiffs and defendants in cases involving exposure to benzene. G.L. has received funds from the American Petroleum Institute The American Petroleum Institute, commonly referred to as API, is the main U.S. trade association for the oil and natural gas industry, representing about 400 corporations involved in production, refinement, distribution, and many other aspects of the industry. for consulting on benzene-related health research. Received 5 October 2004; accepted 14 March 2005. |
|
||||||||||||||||

ēn')
Printer friendly
Cite/link
Email
Feedback
Reader Opinion