Organization and evolution of the Cyp2 gene cluster on mouse chromosome 7, and comparison with the syntenic human cluster *.Genes from the cytochrome cytochrome (sī`təkrōm'), protein containing heme (see coenzyme) that participates in the phase of biochemical respiration called oxidative phosphorylation. P450 (CY-P) superfamily superfamily /su·per·fam·i·ly/ (soo´per-fam?i-le) 1. a taxonomic category between an order and a family. 2. encode a diverse group of monooxygenases that play important roles in both endogenous endogenous /en·dog·e·nous/ (en-doj´e-nus) produced within or caused by factors within the organism. en·dog·e·nous adj. 1. Originating or produced within an organism, tissue, or cell. processes and in the metabolism of exogenous Exogenous Describes facts outside the control of the firm. Converse of endogenous. compounds, including most drugs. A cluster of Cyp2 genes on mouse chromosome 7 was mapped and analyzed in detail and compared with the homologous homologous /ho·mol·o·gous/ (ho-mol´ah-gus) 1. corresponding in structure, position, origin, etc. 2. allogeneic. ho·mol·o·gous adj. 1. cluster on human chromosome 19. The mouse cluster includes 22 loci loci [L.] plural of locus. loci Plural of locus, see there from the same six CYP2 subfamilies--Cyp2a, Cyp2b, Cyp2f, Cyp2g, Cyp2s, Cyp2t--that are found in the human cluster. Twelve of these loci are functional genes, and 10 are pseudogenes. Parts of the mouse and human gene clusters are similarly arranged, but the data indicate that a significantly different series of duplication events created the modern gene organizations in the two species. The comparison of the mouse and human clusters provides new insights into the evolution of gene families, whereas the detailed analysis provides background information that should be informative for future studies on the expression, regulation, and function of specific mouse Cyp2 genes. Key words: Cyp2a, Cyp2b, Cyp2f, Cyp2g, Cyp2s, Cyp2t, cytochrome P450, gene family, gene duplication Gene duplication is any duplication of a region of DNA that contains a gene; it may occur as an error in homologous recombination, a retrotransposition event, or duplication of an entire chromosome. [1]. . Environ Health Perspect 111:1835-1842 (2003). doi: 10.1289/txg.6546 available via http://dx.doi.org/[Online 24 September 2003] ********** Genes from the ancient cytochrome P450 (CYP) superfamily, which encode a large and diverse group of heme-thiolate monooxygenases, are present in the genomes of almost all species examined to date. Mammalian CYP enzymes have been particularly well studied because they detoxify de·tox·i·fy v. 1. To counteract or destroy the toxic properties of a substance. 2. To remove the effects of poison from something, such as the blood. 3. or activate a wide range of environmental and ingested in·gest tr.v. in·gest·ed, in·gest·ing, in·gests 1. To take into the body by the mouth for digestion or absorption. See Synonyms at eat. 2. compounds, including many drugs (Nelson et al. 1996), and they play important roles in many endogenous processes such as the metabolism of fatty acids fatty acid, any of the organic carboxylic acids present in fats and oils as esters of glycerol. Molecular weights of fatty acids vary over a wide range. The carbon skeleton of any fatty acid is unbranched. Some fatty acids are saturated, i.e. , steroids, and eicosanoids (Nebert and Russell 2002). The CYP superfamily is also an excellent group for studying the evolutionary mechanisms that create gene families; previous studies of" this group have provided clear examples of the molecular processes, such as tandem duplication and gene conversion, that are considered to be most important in gene family evolution (Fernandez-Salguero et al. 1995; Gonzalez and Nebert 1990). The CYP superfamily of genes has been divided hierarchically into families and subfamilies on the basis of sequence similarity, and there is a standardized nomenclature nomenclature /no·men·cla·ture/ (no´men-kla?cher) a classified system of names, as of anatomical structures, organisms, etc. binomial nomenclature incorporating this hierarchy (Nelson et al. 1993, 1996). Families are designated by adding a number to the root "CYP" ("Cyp" in mouse, e.g., Cyp2), and subfamilies are indicated by a letter (e.g., Cyp2a). Genes within a subfamily subfamily /sub·fam·i·ly/ (sub´fam-i-le) a taxonomic division between a family and a tribe. sub·fam·i·ly n. A taxonomic category ranking between a family and a genus. are numbered in order of discovery, regardless of species, and pseudogenes (both partial and full-length) are named by' adding "ps" to the related mouse gene ("P" for other species) or by adding independent numbers if no true genes are highly related. In general this system appears to reflect evolutionary relationships among the loci (Lewis et al. 1998), with different CYP gene families found in different major taxonomic groups. In vertebrates, larger families are divided into subfamilies that are typically scattered across genomes, but multiple loci from within a subfamily are usually physically clustered together on a single chromosome (Nelson et al. 1996). This pattern has been interpreted as reflecting the creation of most new CYP loci by tandem duplication (Nelson et al. 1993) so that recently duplicated and therefore highly similar loci remain in tandem Adv. 1. in tandem - one behind the other; "ride tandem on a bicycle built for two"; "riding horses down the path in tandem" tandem clusters, whereas older duplications have been broken up over evolutionary time by chromosomal rearrangements. In this era of genome projects genome project 1 The Human Genome Project, see there 2. A general term for a coordinated research initiative for mapping and sequencing the genome of any organism , it would seem relatively simple to analyze the organization and evolution of CYP gene clusters, based on available assembled sequences. However, even though both the human and mouse genome projects have now produced huge stockpiles of sequence information (Waterson et al. 2002), additional study is typically required for highly duplicated portions of mammalian genomes such as gene family clusters family cluster Epidemiology A grouping of disorders found in ≥ 2 members of a family (Eichler 1998). Computer-based assemblies cannot by themselves accommodate the complexities presented by clusters of closely related genes, and both polymerase chain reaction polymerase chain reaction (pŏl`ĭmərās') (PCR), laboratory process in which a particular DNA segment from a mixture of DNA chains is rapidly replicated, producing a large, readily analyzed sample of a piece of DNA; the process is (PCR PCR polymerase chain reaction. PCR abbr. polymerase chain reaction Polymerase chain reaction (PCR) )-based and hybridization-based analyses are confounded by the high levels of sequence similarity between paralogous loci. Mouse and human are estimated to have 190 and 115 CYP loci (including pseudogenes), respectively, some of which have very similar sequences (Hoffman and Keeney 2002). Thus, the study of these gene clusters has been best served by combining genomic sequencing genomic sequencing The sequencing of the entire genome of an organism. A Closer Look The technique that allows researchers to read and decipher the genetic information found in the DNA of anything from bacteria to plants to animals is with fine-scale physical mapping based on analyses of cloned DNA DNA: see nucleic acid. DNA or deoxyribonucleic acid One of two types of nucleic acid (the other is RNA); a complex organic compound found in all living cells and many viruses. It is the chemical substance of genes. (Hoffman et al. 2001). Detailed physical mapping using genomic clones proved to be an extremely fruitful approach for understanding one human gene cluster, the CYP2 cluster on chromosome 19 (Hoffman et al. 2001), which includes loci from the CYP2A, 2B, 2F, 2G, 2S, and 2T subfamilies (hereafter the CYP2A-T A-T Ataxia Telangiectasia (form of muscular weakness) cluster). Extending this map-based approach to the mouse, the most important model organism A model organism is a species that is extensively studied to understand particular biological phenomena, with the expectation that discoveries made in the organism model will provide insight into the workings of other organisms. for genetic studies in mammals, will allow researchers to better study the expression and variation of these genes. Until all individual genes and their related pseudogenes from a cluster have been analyzed in a species, it is nearly impossible to develop PCR primers that are sufficiently locus-specific to use for accurate genotyping Genotyping refers to the process of determining the genotype of an individual with a biological assay. Current methods of doing this include PCR, DNA sequencing, and hybridization to DNA microarrays or beads. , cloning into expression vectors expression vector n. A vector, such as a plasmid, yeast, or animal virus genome, used to introduce foreign genetic material into a host cell in order to replicate and amplify the foreign DNA sequences as a recombinant molecule. , or development of knockout mice. This study is intended to provide a practical basis for future studies of CYP gene expression, as well as to make a contribution to our understanding of the mechanisms underlying gene family evolution. Materials and Methods Families of closely related genes are difficult to analyze using standard techniques--because of high sequence similarity, a closely related gene or pseudogene pseu·do·gene n. A segment of DNA resembling a gene but lacking a genetic function. pseudogene a nonfunctional DNA sequence, nearly homologous to a functional gene. can easily be mistaken for the target gene during Southern blotting or PCR amplifications. The general procedure discussed below was followed for all loci, but specific techniques were integrated in different combinations as necessary to localize lo·cal·ize v. lo·cal·ized, lo·cal·iz·ing, lo·cal·iz·es v.tr. 1. To make local: decentralize and localize political authority. 2. and analyze each putative locus. Mouse bacterial artificial chromosome A bacterial artificial chromosome (BAC) is a DNA construct, based on a fertility plasmid (or F-plasmid), used for transforming and cloning in bacteria, usually E. coli. (BAC BAC abbr. blood alcohol concentration ) clones overlapping the Cyp2a-t cluster region were identified by library screening or database analysis, as described below. Specific exons and introns of CYP2 genes (see Supplemental Material) were then PCR-amplified from clone DNAs and sequenced. Some of the BAC clone DNAs were digested by multiple restriction enzymes restriction enzyme Protein (more specifically, an endonuclease) produced by bacteria that cleaves DNA at specific sites along its length. Thousands have been found, from many different bacteria; each recognizes a specific nucleotide sequence. , blotted onto nylon membranes, and hybridized with PCR-amplified products. Clones were assembled on the basis of shared gene sequences and on restriction mapping restriction mapping the ordering, left to right, of the set of DNA fragments of a DNA molecule produced by a particular restriction enzyme. . Clone overlaps were confirmed by developing sequence tag sites from the ends of each clone and testing them against other cloned DNAs and against known sequence fragments from both the public and Celera (Celera Genomics, Rockville, M D) mouse genome projects. Library Screening and Isolation of Bacterial Artificial Chromosome DNA The RP22 mouse BAC library was screened (Invitrogen Corp., Carlsbad, CA) with PCR products amplified from mouse genomic DNA genomic DNA n. The full complement of DNA contained in the genome of a cell or organism. , using primers designed to match one exon Exon In split genes, a portion that is included in the ribonucleic acid (RNA) transcript of a gene and survives processing of the RNA in the cell nucleus to become part of a spliced messenger RNA (mRNA) or structural RNA in the cell cytoplasm. each from the Qyp2a5, Cyp2b9, and Cyp2f2 eDNA sequences. The BAC library clones RP22-78A19, -44B20, -362C 15, -127H7, -548M4, and -160014 gave strong positive signals and were selected for further study. Public mouse genomic sequences and private sequences created by Celera were later examined for the presence of Cyp2a-t genes by comparing them with known Cyp2 cDNA sequences (Table 1). Partially sequenced clones from the RP23 BAC library identified by database analysis as containing Cyp2a-t cluster genes were -430G14, -174D7, -113D13, and -353B5 [GenBank accession nos. AC087157, AC087137, AC087130, AC087155, respectively (gene accession numbers are from GenBank: http://www.ncbi.nlm.nih.gov/Entrez)]. Additional RP23 BAC clones--RP23-368014, -120B2, and -314C12--were identified using the National Center for Biotechnology Information The National Center for Biotechnology Information (NCBI) is part of the United States National Library of Medicine (NLM), a branch of the National Institutes of Health. The NCBI is located in Bethesda, Maryland and was founded in 1988. (NCBI) MapViewer tool (http://www.ncbi.nlm. nih.gov/mapview) as being unsequenced but from the region of interest. Of the 13 BAC clones selected by library screening or by database analysis, one (RP23-353B5) was used only for sequence analysis. The DNAs of the remaining 12 clones were obtained and isolated using Qiagen 100 columns (Qiagen Inc., Valencia, CA), according to according to prep. 1. As stated or indicated by; on the authority of: according to historians. 2. In keeping with: according to instructions. 3. the manufacturer's protocol. Clones RP23-368014, -120B2, and -314C12 were used only as PCR templates for specific primers to establish patterns of clone overlap. RP22-78A19, -44B20, -362C 15, - 127H7, -548M4, -160014, and RP23-430G14, -174D7, and -113D13 were used for all other experiments in this study, including PCR amplifications, sequencing, restriction mapping, and Southern blotting. Unless otherwise noted, these nine clones are the BAC clones referred to throughout the rest of this article. PCR Amplification Polymerase chain reaction was performed on all samples using primers designed from mouse Cyp2 gene cDNA and genomic sequences in GenBank and from sequences available from Celera Genomics. All primer sequences and annealing annealing (ənēl`ĭng), process in which glass, metals, and other materials are treated to render them less brittle and more workable. temperatures can be found in Table 1 of the Supplemental Material or on the Cytochrome P450 Homepage (Nelson 2003). PCR amplification was carried out in a total volume of 25 [micro]L consisting of 1 x PCR buffer (10 mM Tris-HCl, pH 9.0; 50 mM KCI KCI Kansas City International (airport) KCI Kennel Club of India KCI Key Club International KCI Korea Concrete Institute KCI Kitchener Collegiate Institute KCI Kids Central, Inc. KCI The Kitchen Collection, Inc. KCI Kodak Canada Inc. ; 0.1% Triton X-100), 1 mM Mg[Cl.sub.2], 200 [micro]M each deoxynucleoside triphosphate triphosphate /tri·phos·phate/ (tri-fos´fat) a salt containing three phosphate radicals. tri·phos·phate n. A salt or ester containing three phosphate groups. , 0.5 [micro]M each primer, 0.75 units of Taq polymerase Taq polymerase ("Taq Pol," or simply "Taq") is a thermostable polymerase used in polymerase chain reaction to check for the presence or absence of a gene by amplifying a DNA fragment. It replaced E.coli DNA polymerase in PCR because of the temperature conditions of PCR. (Promega Corp., Madison, WI), and 100-500 ng template DNA. Amplification reactions were performed with various annealing temperatures and cycles. PCR products were electrophoresed on 1.5% low-melt agarose agarose more highly purified form of agar with similar uses to agar and widely used in the separation of nucleic acid fragments. gels, from which fragments were excised and purified using the QIAquick Gel Extraction In molecular biology, gel extraction or gel isolation is a technique used to isolate a desired fragment of intact DNA from an agarose gel following agarose gel electrophoresis. Kit (Qiagen). Subcloning and Sequencing Sequencing templates were isolated in either of two ways. For most of the Cyp2a-t loci, PCR amplifications were performed directly from BAC or genomic DNA preparations. In addition, some restriction fragments Restriction fragments are pieces of DNA produced from enzymatic cut. Most of such fragments are generated by the use of restriction enzymes such as EcoRI from E. coli. of BAC clones RP22-78A19, RP22-160014, RP22-548M4, RP23-430G14, and RP23-174D7 were subcloned into plasmid vector pBlueScript KS- (Stratagene Inc, La Jolla La Jolla (lə hoi`yə), on the Pacific Ocean, S Calif., an uninc. district within the confines of San Diego; founded 1869. The beautiful ocean beaches, in particular La Jolla shores and Black's Beach, and sea-washed caves attract visitors and , CA). Five to ten positive subclones were chosen by blue/white screening for each experiment, and plasmid DNAs were recovered using alkaline lysis Alkaline lysis is a method used in molecular biology to isolate plasmid DNA from bacterial cells. Bacteria containing the plasmid of interest is first grown, then lysed with a strong alkaline solution made from sodium dodecyl sulfate (SDS) and sodium hydroxide. minipreps, followed by direct sequencing using the T3 and T7 priming sites. All sequencing was done for both strands, using either BigDye or ET terminator (1) A character that ends a string of alphanumeric characters. (2) A hardware component that is connected to the last peripheral device in a series or the last node in a network. chemistry on an ABI Abi (ā`bī) [short for Abijah], in the Bible, King Hezekiah's mother. (Application Binary Interface) A specification for a specific hardware platform combined with the operating system. 310 automated DNA sequencer A DNA sequencer is an instrument used to automate the DNA sequencing process. DNA sequencers have become more important due to large genomics projects and the need to increase productivity. (Applied Biosystems Applied Biosystems, Inc. (formerly NASDAQ: ABIO) is the original name of a pioneer biotechnology company founded in 1981 in Foster City, California, among the Silicon Valley cities of the southern San Francisco Bay Area. , Foster City, CA). All samples were prepared for sequencing according to manufacturer instructions. The forward and reverse primers were the same primers used for PCR. DNA sequences DNA sequence Genetics The precise order of bases–A,T,G,C–in a segment of DNA, gene, chromosome, or an entire genome. See Base pair, Base sequence analysis, Chromosome, Gene, Genome. were analyzed using the NCBI BLAST similarity search tool (http://www.ncbi.nlm.nih.gov/BLAST/). Southern Blot Analysis South·ern blot analysis n. An electrophoretic procedure used to separate and identify DNA sequences. Each BAC clone DNA was separately digested with the restriction enzymes Cla I, Eco RI, Hind III, Pvu II, Sac I, and Xba I at 37[degrees]C for 12-36 hr. The digests were electrophoresed on 1% agarose gels, and the DNA fragments were blotted onto Hybond (Boehringer Mannheim, Indianapolis, IN) nylon membranes. PCR amplification products were made into fluorescent probes using the Genius DIG-labeling system (Boehringer Mannheim) and hybridized to the blots at 55[degrees]C overnight, followed by rinsing and labeling according to the manufacturer s protocol. Restriction Mapping Restriction maps restriction map for a DNA molecule, is constructed for a particular restriction endonuclease by ordering the fragments left to right in the DNA molecule. By convention, there are six fragments which are labeled in order of decreasing size, ABCDEF, but the order left to right in the of BAC clones RP22-78A 19, -44B20, -362C 15, -127H7, -548M4, -160014, and RP23-430G14, -174D7, and -113D13 were constructed using Eco RI and Hind III (data not shown). These restriction maps were compared with the draft genomic sequences and with the sizes of the fragments that hybridized with gene-specific and BAC end probes to confirm the overall assembly. Independent Bst 1107 I restriction maps of clones RP23-430G14, -174D7, -113D13, and -353B5 from the study of Kim et al. (2001) are available as supplemental data through the Lawrence Livermore National Laboratory Lawrence Livermore National Laboratory: see Lawrence Berkeley National Laboratory. (body) Lawrence Livermore National Laboratory - (LLNL) A research organaisatin operated by the University of California under a contract with the US Department of Energy. (Livermore, CA) website (http:l / greengenes.llnl.gov / mouse/html/syn _table.htm). These maps cover the region from Cyp2a4 through Cyp2b10 (Figure 1) that was most poorly resolved by Eco RI-Hind III mapping and genomic sequencing. Though the online assembled maps (Kim et al. 2001) have large unresolved regions and a number of mistakes, the actual digest data are accurate. We have extracted and reassembled the digest data to form a new map that is consistent with our Eco RI-Hind III restriction maps and with the draft genomic sequences. This composite restriction map (Table 2 of Supplemental Material) is the basis for the distances between the Cyp2a4 through Cyp2b10 loci shown in Figure 1. [FIGURE 1 OMITTED] Analysis of Draft Genomic Sequences Partial draft sequences of some BAC clones from the relevant region of mouse chromosome 7 are available on GenBank as of May 2003 (build 30). Some of the sequences from these clones are incorporated into the supercontig NT_039407, but this assembly contains many errors and should be disregarded. The older supercontigs NW_000303, NW_000307, and NW_000310 (build 29) are incomplete, but for the most part are correctly assembled. We created a more complete and accurate assembly of the draft sequences by rearranging the sequence contigs inside each clone to conform to Verb 1. conform to - satisfy a condition or restriction; "Does this paper meet the requirements for the degree?" fit, meet coordinate - be co-ordinated; "These activities coordinate well" our maps and by comparing small overlapping portions of contigs from different clones. Predicted restriction enzyme cut sites in the database sequences were compared with actual restriction mapping fragment sizes and with the data from Southern blotting to determine the order of sequence contigs, An independent sequence assembly for this region obtained from Celera Genomics (GA_x6K02T2PU9B, as of 1 June 2002) was also used to fill in some gaps. Sequence comparisons were done using the NCBI BLAST and Jellyfish jellyfish, common name for the free-swimming stage (see polyp and medusa), of certain invertebrate animals of the phylum Cnidaria (the coelenterates). The body of a jellyfish is shaped like a bell or umbrella, with a clear, jellylike material filling most of the (LabVelocity Inc., San Francisco San Francisco (săn frănsĭs`kō), city (1990 pop. 723,959), coextensive with San Francisco co., W Calif., on the tip of a peninsula between the Pacific Ocean and San Francisco Bay, which are connected by the strait known as the Golden , CA) software. Results The mouse Cyp2a-t gene cluster forms a small part of a 30-Mb region of chromosome 7 that is syntenic to most of the q arm of human chromosome 19 (Kim et al. 2(i)01). This entire region of synteny synteny /syn·te·ny/ (sin´te-ne) the presence together on the same chromosome of two or more gene loci whether or not in such proximity that they may be subject to linkage.synten´ic syn·te·ny n. in the two species is in the opposite orientation relative to the centromere centromere Structure in a chromosome that holds together the two chromatids. It is the point of attachment to the structure that pulls the chromatids to opposite ends of the cell during cell division (see mitosis). ; the mouse map in Figure 1 is shown with the telomere telomere /telo·mere/ (tel´o-mer) an extremity of a chromosome, which has specific properties, one of which is a polarity that prevents reunion with any fragment after a chromosome has been broken. to the left, the reverse of the conventional mode of display, so that it aligns with the established human map (Hoffman et al. 2001). The mouse Cyp2a-t cluster spans about 1.4 Mb, as measured by restriction mapping. It is delimited de·lim·it also de·lim·i·tate tr.v. de·lim·it·ed also de·lim·i·tat·ed, de·lim·it·ing also de·lim·i·tat·ing, de·lim·its also de·lim·i·tates To establish the limits or boundaries of; demarcate. by the Egln2 gene on the telomeric side and the Axl gene on the centromeric cen·tro·mere n. The most condensed and constricted region of a chromosome, to which the spindle fiber is attached during mitosis. cen side (Figure 1). These genes are orthologs of the EGLN2 and AXL genes that bracket the human cluster. A total of 22 CYP loci from six subfamilies were found in the mouse gene duster; 10 of the loci match previously sequenced mRNAs (Hoffman et al. 2001) and are therefore functional genes. Information on individual loci is compiled in Table 1, and the specific evidence used to localize and identify each locus is organized below by subfamily. The complete map of the region is shown in Figure 1. The relevant parts of the public and private sequence assemblies of mouse chromosome 7 (as of December 2002) are compared in Figure 1 with the composite map generated by this study. Both of these previous sequence assemblies are mostly accurate but incomplete across this region of the chromosome; a more recent assembly of the public data (build 30, February 2003) is markedly less accurate. Significant gaps occur in both assemblies. Gaps in the public contigs are quite accurately sized, but the sizes of several large gaps in the Celera assembly are seriously underestimated (Figure 1). Some apparent gaps in the public assembly can be filled, in fact, by integrating sequences from the draft versions of BAC clones RP23-430G14, -174D7, and -113D13 (GenBank accession nos. AC087157, AC087137, and AC087130) and from the small assembled contigs NW_000304, NW_000305, NW_000306, NW_000308, NW_000309, and NW011833. Two distinct regions of about 50 kb each, which contain the 2b26-ps and 2b27-ps pseudogenes, respectively (Figure 1), are incorrectly merged by both assemblies because of the very high level of sequence similarity between them. The presence of both of these regions on the chromosome was confirmed by restriction mapping and by specific PCR amplifications of fragments that bridge small gaps in the draft sequences. Additional details of experimental methods and results, including tables of the primers used, exact exon/intron boundaries, and restriction map data, are available in Tables 1-4 of the Supplemental Material and through the Cytochrome P450 Homepage (Nelson 2003). Descriptions of Loci by Subfamily Cyp2a. Three functional mouse Cyp2a genes, 2a4, 2a5, and 2a12, were previously identified from mRNAs (Iwasaki et al. 1993). Our analysis has discovered a new full-length 2a locus, located between the 2a5 and 2ai2 genes, that corresponds to a single mouse expressed sequence tag An expressed sequence tag or EST is a short sub-sequence of a transcribed spliced nucleotide sequence (either protein-coding or not). They may be used to identify gene transcripts, and are instrumental in gene discovery and gene sequence determination. (EST EST electroshock therapy. EST abbr. electroshock therapy ) in the database (GenBank accession no. BB667610) and is therefore likely to be functional. It has been given the name Cyp2a22. There are also three partial 2a pseudogenes--Cyp2a20-ps and Cyp2a23-ps, each of which consists only of exons 1 and 2, and Cyp2a21-ps, which has part of exon 3 and all of exons 4-9. To search for additional 2a loci, PCR-amplified fragments from exons 2, 6, and 9 of 2a5 were used to probe Southern blots Southern blot a technique for detecting specific DNA sequences following agar gel electrophoresis of a set of DNA restriction enzyme digestion fragments. The fragments after electrophoresis are transferred to a nitrocellulose or nylon membrane by applying the membrane to the gel; of the BAC clone DNAs. They hybridized to the expected fragments for all genes, which collectively accounted for all positive signals. Specific primers were used to amplify and sequence the sixth exons of 2a4, 2a5, and 2a12 and the third exons of 2a12 and 2a22 from the appropriate BAC clone DNAs (Figure 1) to confirm the locations and identities of these genes. The 2a20-ps pseudogene and the 2a12 gene were identified only from the genomic sequence assemblies, as they were not included in any BAC clones used in this study. The 2a4 and 2a5 genes are extremely similar (98% exons/96% introns), even though they are not physically close together (Figure 2). The 2a12 locus is strongly related by sequence to 2a22 but is quite different from the other genes, with only 75 and 76% exon identities to 2a4 and 2a5, respectively (Table 2). The 2a20-ps and 2a23-ps pseudogenes are very similar to each other and are slightly more similar to the 2a5 locus than to 2a12. The Cyp2a21-ps pseudogene is most similar to 2a5 (Table 2). [FIGURE 2 OMITTED] Cyp2b. The 2b subfamily is the most diverse group within the gene cluster. Four genes previously known to be functional (Nelson et al. 1996)--2b9, 2b10, 2b13, and 2b19--have been identified and localized. A fifth gene previously identified as functional, 2b20 (Damon et al. 1996), and its putative pseudogene 2b20-ps (Marc et al. 1999) are not found in the chromosome 7 gene cluster. Our repeated attempts to amplify a 2b20-specific fragment with the PCR primers designed by Damon et al. (1996), using both BAC clone and mouse genomic templates, were unsuccessful. We therefore conclude that the 2b20 and 2b20-ps transcripts are artifacts artifacts see specimen artifacts. of the very similar 2b10 gene. In addition, the 2b10 gene in both sequence assemblies differs markedly (19 base pair substitutions) from the originally reported 2b10 mRNA (Noshiro et al. 1988); this may be because of interstrain heterogeneity het·er·o·ge·ne·i·ty n. The quality or state of being heterogeneous. heterogeneity the state of being heterogeneous. or to mistakes in sequencing. We now consider all 2b10, 2b20, and 2b20-t)s mRNAs in the database to be products of the single gene identified as 2b10 in Figure 1. To determine the location of each 2b locus, PCR products generated from exons 1, 7, and 9 of 2b9 were used to probe the BAC clone Southern blots. They hybridized to the appropriate DNA fragments from each of the 2b genes and pseudogenes. Specific probes were also made for exon 4 of 2b13 and exon 3 of 2b19 that hybridized uniquely to fragments of those genes on blots. Primers specific for exon 2 of 2b10 were used to amplify and sequence a fragment of BAC clone RP23-113D13 to confirm the identity of this locus. Specific primers were also used to amplify and sequence fragments of intron Intron In split genes, a portion that is included in ribonucleic acid (RNA) transcripts but is removed from within a transcript during RNA processing and is rapidly degraded. 2 and intron 3 from the nearly identical 2626-ps and 2b27-ps pseudogenes. These fragments were then used as probes on the blots to prove the separate existence of the two pseudogenes. PCR products encoding intron 2 of 2b26-ps/2627-ps hybridized to Eco RI fragments of 10.6 and 9.5 kb, respectively, in the BAC clones RP23-430G14 and -113D13, and to both fragments in the BAC clone RP23-174D7, which overlaps both pseudogenes (Figure 1). Five of the 2b loci can confidently be identified as pseudogenes because they consist of less than the nine exons common to functional Cyp2b genes (Table 1). The partial nature of the 2b24-ps, 2b25-ps, 2626-ps, 2b27-ps, and 2b28-ps pseudogenes was confirmed by exon-specific PCRs from the appropriate BAC clones. The locus labeled 2b23 in Figure 1 has not been previously identified as a functional gene, but it has all nine exons, includes a legitimate heme signature in the ninth exon, and has no premature stop codons or frameshift mutations. Differences between the public and Celera versions of this sequence yield a few alternative amino acid amino acid (əmē`nō), any one of a class of simple organic compounds containing carbon, hydrogen, oxygen, nitrogen, and in certain cases sulfur. These compounds are the building blocks of proteins. residues but do not affect the viability of any potential product. The 2623 sequence does not match any mRNAs or ESTs currently listed in GenBank, so it must be listed as a potentially functional new 2b gene, pending a search through more tissue types for a matching mRNA. There are thus a total of five confirmed or potentially functional genes and five partial pseudogenes within the 2b subfamily in mouse. The rule that functional genes in the Cyp2 family have a nine-exon structure is violated in the mouse by the 2b10 gene. Two eDNA sequences were originally described for this gene (Noshiro et al. 1988), one with a standard length of 1,476 base pairs, and a second rare form with a stretch of 27 extra nucleotides that were presumed to belong either to the end of exon 8 or the beginning of exon 9. However, our analysis of the genomic sequence of 2b10 makes it clear that these base pairs in fact make up a small additional exon with valid splice sites (Figure 3). This "miniexon," which encodes only nine amino acids, appears to have been recruited from sequence that previously formed part of the eighth intron of the ancestral 2b gene. We have identified sequences in the eighth introns of the 2b9 and 2b13 genes that are very similar to the miniexon, but both of these introns have critical differences that prevent the formation of splice sites (Figure 3). Because the nine additional amino acids would disrupt a critical motif in the enzyme, it is likely that the long form of 2b10 represents a nonfunctional splice variant. [FIGURE 3 OMITTED] Evolutionary relationships among the paralogous mouse 2b loci are far from clear. The 2b9 and 2b13 genes are more highly related to each other than to either 2b10 or 2b19, and the new 2623 gene is somewhat more similar to 2b19 than to the other genes. Any other pairing among the functional 2b genes gives the same average identity level of about 85% across exons (Table 2). The 2b pseudogenes are also not closely related to specific functional genes except for the 2b28-/as partial pseudogene, which is somewhat more similar to 2b13 and 2b9 than to the other 2b genes (Table 2). As noted above, the 2b26-ps and 2b27-ps pseudogenes are nearly identical, but they do not show a particular affiliation to any of the functional genes. Cyp2f. There is only a single member of the 2f subfamily in the mouse, the functional 2f2 gene, which is located centromeric of and close to the 2t4 gene (Figure 1) in a position exactly corresponding to that of the human 2F1P locus. Unlike the human and gorilla (Chen et al. 2002), the mouse does not have a second 2f locus. This was established by the failure of intron 1 and exon 9 primers to amplify from any of the BAC clones and by the lack of hybridization hybridization /hy·brid·iza·tion/ (hi?brid-i-za´shun) 1. crossbreeding; the act or process of producing hybrids. 2. molecular hybridization 3. to the Southern blots using a 2fc2 exon 9 probe. Cyp2g. The 2g subfamily consists of the gene responsible for the known CYP2G 1 enzyme, located just centromeric of the 2a5 gene, and the partial pseudogene 2g1-ps, which lies just centromeric of the 2a4 gene (Figure 1). Cyp2g1-ps has only exons 7, 9, and half of 8, which collectively are about 96% identical to the corresponding portions of 2g1 (Table 2). As both sequence assemblies are very fragmented near 2g1-ps, the pseudogene sequence can currently be found only in the Celera assembly, and even there it is incomplete. Because 2g1 is also incomplete in the public assembly (Table 1), its identity was confirmed by PCR-amplifying and sequencing fragments of exons 1, 2, 6, and 9 from the BAC clone RP22-78A19. To prove the partial nature of the 2g1-ps pseudogene, the RP23-430G14 BAC clone was used as template for the same amplifications; only the exon 9 primers gave a product. In addition, the exon 1 and 6 products hybridized only to clone RP22-78A19 on the Southern blots and not to RP23-430G 14. Cyp2s. As is true for the human, the mouse has a single member of the 2s subfamily located at one end of the cluster, close to the Axl gene (Figure 1). Neither Southern blots nor PCR amplifications gave evidence of any additional 2s loci. Cyp2t. There is also only a single member of the 2t subfamily in the mouse, 2t4. It is very similar to the functional rat 2T1 gene (Nelson 2003), but the predicted 2t4 mRNA does not match any mouse eDNA or EST now in GenBank. The gene is located at the extreme telomeric end of the mouse cluster, only 8 kb from the Egln2 locus (Figure 1), in the same relative position as the human pseudogene 2T2P T2P Type-Two Phaser (Star Trek) T2P Transition to Production (computer systems development) . The mouse 2t4 is slightly more related to human 2T2P than to human 2T3P (Table 2). Discussion The Cyp2 subfamilies in the mouse cluster are the same six present in the corresponding human cluster (CYP2A, 2B, 2F, 2G, 2S, and 2T), but the total number of loci is significantly greater in mouse (22 vs. 13). This difference is due primarily to the expansion of the 2a and 2b subfamilies in mouse (Figure 2). The similarities between the CYP2A-T gene clusters in the mouse and human indicate that the six component subfamilies were already present in a common mammalian ancestor. The differences in organization indicate that most of the individual loci within the subfamilies developed after the primate and rodent rodent, member of the mammalian order Rodentia, characterized by front teeth adapted for gnawing and cheek teeth adapted for chewing. The Rodentia is by far the largest mammalian order; nearly half of all mammal species are rodents. lineages split. Some specific loci, however, may have developed in the common ancestor, and therefore may be truly orthologous. To trace the evolution of the gene clusters in any detail, it is necessary to distinguish these older orthologous loci from newer, species-specific loci. Defining orthologs between mouse and human also facilitates the creation of appropriate knockout animals. It is often difficult or impossible to identify orthologs of CYP genes in all but very closely related species (Nelson et al. 1993), but when sequence similarity, physical location, and protein function all match, this can be done at least tentatively (Chen et al. 2002; Hoffman et al. 2001). In the case of the CYP2A-Tclusters, some orthologous relationships can be reasonably deduced. The mouse Cyp2a5 locus may be a true ortholog of the human CYP2A6, as they have the most similar sequences (Table 2) and they are located at similar positions within the two clusters (Figure 2A). The single mouse 2f2 and the functional human 2F1 both express proteins with similar substrate ranges and the same limited tissue distribution and are thus considered orthologous (Chen et al. 2002). The mRNAs known from the single mouse 2s1 and human 2S1 genes are 81% identical, and these genes are similarly located on the AXL ends of their respective clusters, so they can also be considered orthologous (Hoffman et al. 2001). The mouse 2g1 is equally similar to the two human 2G pseudogenes (83%), which are probably degraded copies of an earlier functional gene that was orthologous to 2g1. Finally, the presumably pre·sum·a·ble adj. That can be presumed or taken for granted; reasonable as a supposition: presumable causes of the disaster. functional mouse 2t4 has the same position on the EGLN2 end of the cluster as does the human 2T2P and is likely to be its ortholog (Nelson et al. 2003). Though the two ends of the mouse and human cluster are very similar, with orthologous genes in corresponding positions, the distinct organization in the middle of each cluster indicates that some major rearrangements have occurred since the two species diverged from a common ancestor. In the human, a large inverted repeat inverted repeat blocks of nucleotide sequence that are present in more than one copy, but in a reverse order, such as ABCDE and E,D,C,B,A,; they may be terminal or internal. Called also indirect repeat. See also palindrome. was inferred to explain the mirror-image organization underlying the paired 2F, 2T, 214, and 2G loci in the center of the cluster (Hoffman et al. 1995, 2001). In the mouse, there is no such mirror-image set of loci. Instead, there are single 2land 2t genes, while the central loci are arranged a-g-b-a-g-b rather than a-g-b-b-g-a as in humans, suggesting a more limited tandem duplication without an inversion (Figure 2A). Additionally, in humans the 2B6 and 2B7P loci were apparently inserted into the middle of the 2A18P locus (Hoffman et al. 2001), whereas in mouse the many 2b subfamily genes occur in two separate groups, with no sign of a late insertion. Figure 2A illustrates our hypothesis that the basic organization of the central loci in mouse is due to a large tandem duplication encompassing 2a, 2g, and perhaps 2b loci. This hypothesis is supported by the fact that the 2a5 and 2gl genes on the telomeric side of the cluster are transcribed in the same direction and are spaced a similar distance apart as are the 2a4 and 2g1-ps loci on the centromeric side. It is also consistent with the suggestion of Aida et al. (1994) that because Mus musculus has both 2a4 and 2a5 genes, whereas its close relative Mus spretus appears to have two nearly identical 2a5-like loci, 2a4 must have been formed recently by the alteration of a critical residue in a previously duplicated copy of the 2a5 gene. This large tandem duplication may or may not have included loci from the 2b subfamily. Though 2a4 is very similar in sequence to 2a5, and 2g1 to 2g1-ps, there are no highly related 2b genes across the two groups, as would be expected if an a-g-b block of sequence had been recently duplicated. In addition, the 2b loci are not as clearly patterned as the 2a and 2g loci-multiple 2b genes are transcribed in different directions next to both the 2gl and 2g1-ps loci. Conversely, the 2b loci do occur in two distinct groups that are in similar positions relative to the 2a and 2g loci. Although there is thus good evidence for a tandem duplication that included at least one 2a and one 2g locus, the timing and the extent of this duplication cannot be determined until the corresponding genes are examined in additional mammalian species. Full or partial deletions of single loci may have occurred in both the primate and rodent lineages, but these cannot be detected. Gene losses by deletion should always be harder to distinguish than duplications, as they are unlikely to leave behind any characteristic pattern or signature sequences. In particular, partial pseudogenes in both clusters may have been created either by a duplication followed by deletion of some exons or by a duplication encompassing only part of a locus. On a smaller scale, several interesting comparisons can be made among the numerous 2a and 2b subfamily genes. The 2a subfamily expanded in the mouse by a series of duplications involving single and multiple loci. The group of six 2a loci on the telomeric side of the cluster includes three highly related pairs (Table 2). The whole 35-kb block of DNA that includes Cyp2a22 and 2a23-ps is highly similar to the block containing Cyp2a12 and 2a20-ps, with more than 90% identity between the regions around the genes. The 20-kb region that encompasses Cyp2a5 shares more than 90% sequence identity with the region around Cyp2a21-ps. The simplest explanation for this arrangement, shown in Figure 2B, requires several rounds of duplication. The exact order in which the duplications happened is ambiguous, as there is no significant patterning of sequence in between the duplicated blocks. The two strongest similarities within the 2b subfamily are between the 269 and 2b13 loci and between the 2626-ps and 2627-ps pseudogenes (Table 2). The relative positions and the directions of transcription of these gone pairs suggest that a second, smaller tandem duplication occurred within the centromeric 2b group to create the 269-2626-ps and 2b13-2627-ps regions, as shown in Figure 2C. Evidence for this duplication also comes from the extremely high level of identity (99%) found between short noncoding sequences in the introns of the 2626-ps and 2627-ps pseudogenes (data not shown). The information provided by this study has allowed us to draw a complete and accurate picture of the Cyp2a-t gene cluster in the mouse and to understand the similarities and differences between its evolution and that of the human cluster. This comparison should enable researchers to better utilize the mouse as a model system for the study of these CYP genes in humans and in other mammals. Table 1. Table of Cyp2a-t subfamily cluster genes in mouse. Locus Orientation Location (a,b) 2t4 Cen [right arrow] Tel 310 157707-161230 2f2 Cen [right arrow] Tel 310 124085-136041 2a20-ps Tel [right arrow] Cen 310 52619-53192 2a12 Cen [right arrow] Tel 310 33243-40784 2a21-ps Cen [right arrow] Tel 308 17546-20076 2a22 Tel [right arrow] Cen 308 1879-9066 2a23-ps Cen [right arrow] Tel NW_011833 5827-6402 2a5 Cen [right arrow] Tel - 2g1 Cen [right arrow] Tel 307 763765-775566 2b19 Cen [right arrow] Tel 307 711178-725301 2b24-ps Cen [right arrow] Tel 307 692575-699876 2b23 Tel [right arrow] Cen 307 618973-640139 2a4 Cen [right arrow] Tel 307 266450-274118 2g1-ps Cen [right arrow] Tel - 2b25-ps Tel [right arrow] Cen 307 195792-195980 2b9 Cen [right arrow] Tel 307 144000-171613 2b26-ps Tel [right arrow] Cen AC 157 22100-6200 2b13 Cen [right arrow] Tel 3071-32300 2b27-ps Tel [right arrow] Cen 303 2122792-2130037 2b28-ps Cen [right arrow] Tel 303 2064442-2094900 2b10 Cen [right arrow] Tel 303 2012844-2040458 2s1 Tel [right arrow] Cen 303 1917585-1931046 Locus Exons mRNA 2t4 1-9 2f2 1-9 NM_007817 2a20-ps 1-2 2a12 1-9 NM_133657 2a21-ps 3-9 2a22 1-9 2a23-ps 1-2 2a5 1-9 NM_007812 2g1 1-9 NM_013809 2b19 1-9 NM_007814 2b24-ps 7-9 2b23 1-9 2a4 1-9 NM_009997 2g1-ps 7-9 2b25-ps 9 2b9 1-9 NM_010000 2b26-ps 2-6 2b13 1-9 NM_007813 2b27-ps 2-9 2b28-ps 1-6 2b10 1-10 NM_009998 2s1 1-9 NM_028775 Locus Gap (c) 2t4 2f2 2a20-ps 2a12 2a21-ps Exons 3, 4, 8, 9 2a22 2a23-ps 2a5 All exons 2g1 Exons 8, 9 2b19 2b24-ps 2b23 2a4 2g1-ps Exons 7-9 2b25-ps 2b9 Exon 1 2b26-ps Exons 2, 3, 6 2b13 Exon 1 2b27-ps Exons 2-4, 9 2b28-ps 2b10 2s1 Abbreviations: Cen, centromere; Tel, telomere. (a) GenBank supercontigs NW_000303, NW 000307, and NW 000310 are abbreviated 303, 307, and 310. Contig AC087157 is abbreviated AC157. (b) Locations are base pair numbers within supercontigs that span the first to last codon of a locus. Several loci are found only in BAC clone sequence (GenBank no. AC087157) and not in supercontigs. (c) Gap column shows parts of the coding sequences not included in the GenBank sequence assemblies. Table 2. Comparison of Cyp2a and Cyp2b subfamily coding sequences among related mouse loci and selected human loci. (a) Locus 2a5 2a 12 2a4 98% (1,485) 75% (1,491) 2a5 -- 76% (1,485) 2a12 -- -- 2a22 -- 96% (1,506) 2a21-ps 97% (1,066) -- 2a23-ps 97% (576) -- Locus 2b10 2b13 2b9 86% (1,503) 92% (1,473) 2b10 -- 85% (1,503) 2b13 -- -- 2b19 -- -- Locus 2b24-ps (Exons 7-9) 2b25-ps (Exon 9) 2b9 87% (516) 80% (188) 2b10 86% (509) 82% (184) 2b13 86% (505) 81% (186) 2b19 86% (470) 80% (183) 2623 86% (505) 85% (182) Locus 2b28-ps (Exons 1-4) CYP2B6 (b) 2b9 92% (664) 75% (1,476) 2b10 83% (649) 76% (1,503) 2b13 94% (656) 75% (1,476) 2b19 82% (632) 77% (1,479) 2b23 82% (668) 77% (1,470) Locus 2a20-ps (Exons 1, 2) CYP2A6 (b) 2a4 72% (345) 83% (1,485) 2a5 73% (334) 84% (1,485) 2a12 72% (162) 75% (1,485) 2a22 -- -- 2a21-ps -- -- 2a23-ps -- -- Locus 2b19 2b23 2b9 84% (1,479) 86% (1,476) 2b10 85% (1,506) 87% (1,476) 2b13 84% (1,479) 86% (1,462) 2b19 -- 88% (1,425) Locus 2b26-ps (Exons 4, 5) 2627-ps(Exons 5-8) 2b9 84% (333) 76% (651) 2b10 85% (334) 78% (656) 2b13 80% (334) 76% (655) 2b19 84% (335) 78% (652 2623 82% (334) 81% (506) Locus 2b9 2b10 2b13 2b19 2b23 (a) Numbers shown are percent identical nucleotides and total nucleotides compared between coding sequences (in parentheses). (b) CYP2A6 and CYP2B6 are representative genes from the same subfamilies in humans. We thank K. Gavit, L. Kaplan, A. Parent, M. Smith, S. Whitehead, and E. Workman for laboratory assistance. We also thank D. Nelson (University of Tennessee The University of Tennessee (UT), sometimes called the University of Tennessee at Knoxville (UT Knoxville or UTK), is the flagship institution of the statewide land-grant University of Tennessee public university system in the American state of Tennessee. , Memphis) and D. Nebert (University of Cincinnati The University of Cincinnati is a coeducational public research university in Cincinnati, Ohio. Ranked as one of America’s top 25 public research universities and in the top 50 of all American research universities,[2] ) for helpful discussions, and J. Vaughn, K. Killian, and D. Pennock (Miami University Miami University, main campus at Oxford, Ohio; coeducational; state supported; chartered 1809, opened 1824. The library has extensive collections in literature and American history, including the William Holmes McGuffey Library and Museum and the Edgar W. ) for reviewing the manuscript. This work was supported by National Institutes of Health grant (NIH) 1R15GM55951 (SMGH), and Medical Research Service, Department of Veterans Affairs Veterans Affairs is a term of the business that deals with the relation between a government and its veteran communities, usually administered by the designated government agency. , and NIH grants R01 AR45603, P30 AR41943, and P30 ES00267 (DSK DSK Disk DSK Dominique Strauss-Kahn (French Finance Minister 1997-99) DSK Deutsche Steinkohle AG DSK DSP Starter Kit (Texas Instruments) DSK Downstream Keyer DSK Dvorak Standard Keyboard ). The authors declare they have no conflict of interest. * The online version of this article (available at http://www.ehponline.org) contains Supplemental Material, Tables 1-4. REFERENCES Aida K, Moore R, Negishi M. 1994. Lack of the steroid 15[alpha]-hydroxylase gene (Cyp2a-4)in wild mouse strain Mus spretus: rapid evolution of the P450 gone superfamily. Genomics 19:564-566. Chen N, Whitehead SE, Caillat AW, Gavit K, Isphording DR, Kovacevic D, et al. 2002. Identification and cross-species comparisons of CYP2F subfamily genes in mammals. Mutat Res 499:151-161. Damon M, Fautrel A, Marc N, Guillouzo A, Corcos L. 1996. Isolation of a new mouse cDNA clone: hybrid form of cytochrome p450 2b10 and NAPDH-cytochrome p450 oxidoreductase oxidoreductase /ox·i·do·re·duc·tase/ (ok?si-do-re-duk´tas) any of a class of enzymes that catalyze the reversible transfer of electrons from a substrate that becomes oxidized to one that becomes reduced (oxidation-reduction, or redox . Biochem Biophys Res Commun 226:900-905. Kim J, Gordon L, Dehal P, Badri H, Christensen M, Groza M, et al. 2001. Homology-driven assembly of a sequence-ready mouse BAC contig map spanning regions related to the 46-Mb gone-rich euchromatic segments of human chromosome 19. Genomics 74:129-141. Eichler EE. 1998. Masquerading 1. (networking) masquerading - "NAT" (Linux kernel name). 2. (messaging) masquerading - Hiding the names of internal e-mail client and gateway machines from the outside world by rewriting the "From" address and other headers as the message leaves the repeats: paralogous pitfalls of the human genome The human genome is the genome of Homo sapiens, which is composed of 24 distinct pairs of chromosomes (22 autosomal + X + Y) with a total of approximately 3 billion DNA base pairs containing an estimated 20,000–25,000 genes. . Genome Res 8:758-762. Fernandez-Salguero P, Hoffman S, Cholerton S, Mohrenweiser H, Raunio H, Rautio A, et al. 1995. A genetic polymorphism polymorphism, of minerals, property of crystallizing in two or more distinct forms. Calcium carbonate is dimorphous (two forms), crystallizing as calcite or aragonite. Titanium dioxide is trimorphous; its three forms are brookite, anatase (or octahedrite), and rutile. in coumarin coumarin /cou·ma·rin/ (koo´mah-rin) 1. a principle extracted from the tonka bean; it contains a factor, dicumarol, that inhibits hepatic synthesis of vitamin K–dependent coagulation factors, and a number of its derivatives are 7-hydroxylation: sequence of the human CYP2A genes and identification of variant CYP2A6 alleles. Am J Hum Genet genet: see civet. 57:651-660. Gonzalez F, Nebert D. 1990. Evolution of the P450 gone superfamily: animal-plant "warfare," molecular drive and human genetic differences in drug oxidation. Trend Genet 6:182-186. Hoffman S, Fernandez-Salguero P, Gonzalez F, Mohrenweiser H. 1995. Organization and evolution of the cytochrome P450 CYP2A-2B-2Fsubfamily gone cluster on human chromosome 19. J Mol Evol 41:894-900. Hoffman S, Keeney D. 2002. Fine-scale mapping of CYP gone clusters: an example from the human C YP4 family. Methods Enzymol 357:36-44. Hoffman S, Nelson D, Keeney D. 2001, Organization, structure and evolution of the CYP2 gone cluster on human chromosome 19. Pharmacogenetics Pharmacogenetics Definition Pharmacogenetics is the study of how the actions of and reactions to drugs vary with the patient's genes. Description 11:687-698. Iwasaki M, Lindberg R, Juvonen R, Negishi M 1993. Site-directed mutagenesis Site-directed mutagenesis is a molecular biology technique in which a mutation is created at a defined site in a DNA molecule, usually a circular molecule known as a plasmid. In general, site-directed mutagenesis requires that the wild-type gene sequence be known. of mouse steroid 70-hydroxylase (cytochrome P-450 7(z): role of residue-209 in determining steroid-cytochrome P-450 interaction. Biochem J 291:569-573. Lewis DF, Watson E, Lake BG. 1998. Evolution of the cytochrome P450 superfamily: sequence alignments and pharmacogenetics. Mutat Res 410:245-270. Marc N, Damon M, Fautrel A, Guillouzo A, Corcos L. 1999. Isolation of a cyp2b10-1ike cDNA and of a clone derived from a cyp2bl0-1ike pseudogene. Biochem Biophys Res Commun 258:11-16. Nebert DW, Russell DW. 2002. Clinical importance of the cytochromes P450. Lancet 360:1155-1162. Nelson D. 2003. Cytochrome P450 Homepage. Memphis, TN:University of Tennessee. Available: http://drnelson.utmem.edu/CytochromeP450.html [accessed 19 June 2003]. Nelson D, Kamataki T, Waxman D, Guengerich F, Esterbrook R, Feyereisen R, et al. 1993. The P450 superfamily: update on new sequences, gone mapping, accession numbers, early trivial names triv·i·al name n. 1. A common, historic, or convenient name for a substance, derived often from the source in which the substance was discovered, but unsystematic and not used in modern official nomenclature, as aspirin for of enzymes, and nomenclature. DNA Cell Biol 12:1-51. Nelson D, Koymans L, Kamataki T, Stegeman J, Feyereisen R, Waxman D, et al. 1996. Cytochrome P450 superfamily: update on new sequences, gone mapping, accession numbers, and nomenclature. Pharmacogenetics 6:1-42. Noshiro M, Lakso M, Kawajiri K, Negishi M. 1988. Rip locus: regulation of female-specific isozyme isozyme /iso·zyme/ (i´so-zim) one of the multiple forms in which an enzyme may exist in an organism or in different species, the various forms differing chemically, physically, or immunologically, but catalyzing the same reaction. (I-P-450 16[alpha]) of testosterone testosterone (tĕstŏs`tərōn), principal androgen, or male sex hormone. One of the group of compounds known as anabolic steroids, testosterone is secreted by the testes (see testis) but is also synthesized in small quantities in the 16[alpha]-hydroxylase in mouse liver, chromosome localization Customizing software and documentation for a particular country. It includes the translation of menus and messages into the native spoken language as well as changes in the user interface to accommodate different alphabets and culture. See internationalization and l10n. , and cloning of P-450 cDNA. Biochemistry 27:6434-6443. Waterson RH, Lindblad-Toh K, Birney E, Rogers J, Abril JF, Agarwal P, et al. 2002. Initial sequencing and comparative analysis of the mouse genome. Nature 420:520-562. Received 25 June 2003; accepted 24 September 2003. Haoyi Wang, (1) Kyle M. Donley, (1) Diane S. Keeney, (2) and Susan M.G. Hoffman (1) (1) Department of Zoology zoology, branch of biology concerned with the study of animal life. From earliest times animals have been vitally important to man; cave art demonstrates the practical and mystical significance animals held for prehistoric man. , Miami University, Oxford, Ohio Oxford is a college town located in the southwestern portion of the U.S. state of Ohio in northwestern Butler County in Oxford Township, originally called the College Township. The population was 21,943 at the 2000 census (approximately 16,000 students are included in this figure). , USA; (2) VA Tennessee Valley The Tennessee Valley is the drainage basin of the Tennessee River and is largely within the U.S. state of Tennessee. It stretches from southwest Kentucky to northwest Georgia and from northeast Mississippi to the mountains of Virginia and North Carolina. Healthcare System Nashville and the Departments of Medicine/Dermatology and Biochemistry, Vanderbilt University Vanderbilt University, at Nashville, Tenn.; coeducational; chartered 1872 as Central Univ. of Methodist Episcopal Church, founded and renamed 1873, opened 1875 through a gift from Cornelius Vanderbilt. Until 1914 it operated under the auspices of the Methodist Church. School of Medicine, Nashville, Tennessee “Nashville” redirects here. For other uses, see Nashville (disambiguation). Nashville is the capital and the second most populous city of the U.S. state of Tennessee, after Memphis. , USA Address correspondence to S. Hoffman, Dept. of Zoology, Miami University, 700 East High St., Oxford, OH 45(/56 USA. Telephone: (513) 529-3125. Fax: (513) 529-6900. E-mail: hoffmasm@muohio.edu |
|
||||||||||||||||

Printer friendly
Cite/link
Email
Feedback
Reader Opinion