Printer Friendly
The Free Library
18,914,768 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

From point B to point A: applying toxicogenomics to biological inference.


The data points in an integrated toxicogenomics experiment with microarray, proteomics, and metabolomics data are virtually innumerable. With thousands--or tens of thousands--of data points for each sample in each type of analysis, the complexity and the sheer amount of data multiply fast. As teams of statisticians Statisticians or people who made notable contributions to the theories of statistics, or related aspects of probability, or machine learning: A to E
  • Odd Olai Aalen (1947–)
  • Gottfried Achenwall (1719–1772)
  • Abraham Manie Adelstein (1916–1992)
, bioinformaticists, and biologists work to interpret this complexity, they must also ensure that each data point is valid and can be integrated with other data from the same or different types of experiments. Underlying this detailed exercise are two expansive goals. One is to identify markers of toxic exposures or disease. The other is to understand the biological processes underlying disease. The latter is called biological inference--the highly iterative process of inferring cause-and-effect relationships from toxicogenomics data, using computation efforts linked to mathematics.

Efforts to identify markers of exposure are concerned primarily with discerning patterns in output from microarray, proteomics, and metabolomics technology. These patterns can be characterized as molecular fingerprints and can be extremely useful in diagnosing levels of exposure, even though researchers may not understand exactly why particular patterns appear. In contrast, biological inference is concerned with an understanding of how patterns in genomics data actually translate into the details of gene transcription Gene transcription
The process by which genetic information is copied from DNA to RNA, resulting in a specific protein formation.

Mentioned in: Gene Therapy
, protein creation, and metabolism.

By studying associations among the expression of genes, proteins, and metabolites Metabolites
Substances produced by metabolism or by a metabolic process.

Mentioned in: Interactions
, researchers try to identify genes of influence, many of which act as hubs of metabolic networks, affecting many other genes. Transient hubs, those that act briefly as a cell changes state, can be especially difficult to find and analyze. A goal of special importance to toxicogenomics is to distinguish endogenous pathways--those involved in the cell's normal chores of metabolism and reproduction--from exogenous pathways triggered by exposures to drugs or toxicants. The ultimate goal is to follow a pathway from the expression of genes through the creation and modification of proteins and metabolites, as well as all the associated gene-gene, protein--protein, and metabolite-metabolite interactions in between.

Anchor Management

Inferring biological pathways requires research teams to mine and interpret vast quantities of genomics data. Interpretation strategies include grouping or clustering data to find patterns as well as use of statistical methods to filter data on genes with the strongest signals or those expressed in concert. One key technique in biological inference is phenotypic anchoring, using known biological information to interpret signals, or uncharacterized data, from genomics experiments. These signals indicate the presence of molecules (such as mRNA or proteins in a given range of molecular weights) and can take various forms, depending on the type of technology used. For example, in microarray experiments signals take the form of fluorescence generated by the bonding of strands of mRNA to the microarray slide. In some studies, these genomics data are compared to data from traditional toxicology tests performed on the same samples. In others, the team integrates what is already known or suspected about biological pathways based on past studies.

Examples of research using phenotypic anchoring can be found in acetaminophen acetaminophen (əsēt'əmĭn`əfĭn), an analgesic and fever-reducing medicine similar in effect to aspirin. It is an active ingredient in many over-the-counter medicines, including Tylenol and Midol.  studies sponsored by the NIEHS NIEHS National Institute of Environmental Health Sciences (NIH, DHHS)  National Center for Toxicogenomics (NCT NCT National Childbirth Trust
NCT National Car Test
NCT North Carolina Theatre
NCT National Coordination Team
NCT Northern California TRACON
NCT Noise Cancellation Technology
NCT Network Control and Timing
NCT Nicotine Replacement Therapy
). In this research, microarray data were compared to those from traditional toxicology tests, which both aided in interpretation of the microarray data and led to the development of new knowledge and understanding about the toxicity of this commonly used drug.

For example, in one study published in the July 2004 issue of Toxicological Sciences, data from microarrays and traditional toxicity tests confirmed previous results from other labs showing that toxic doses of acetaminophen deplete de·plete
v.
1. To use up something, such as a nutrient.

2. To empty something out, as the body of electrolytes.
 adenosine adenosine /aden·o·sine/ (ah-den´o-sen) a purine nucleoside consisting of adenine and ribose; a component of RNA. It is also a cardiac depressant and vasodilator used as an antiarrhythmic and as an adjunct in myocardial perfusion imaging  triphosphate triphosphate /tri·phos·phate/ (tri-fos´fat) a salt containing three phosphate radicals.

tri·phos·phate
n.
A salt or ester containing three phosphate groups.
 (ATP ATP: see adenosine triphosphate.
ATP
 in full adenosine triphosphate

Organic compound, substrate in many enzyme-catalyzed reactions (see catalysis) in the cells of animals, plants, and microorganisms.
; molecules that store cellular energy) and damage mitochondria, the organelles that produce ATP. In addition, microarray data revealed other exposure effects that traditional tests had missed. For example, liver cells begin to express genes consistent with cellular energy loss at doses too low to cause the kind of cell damage that can be detected by histopathology his·to·pa·thol·o·gy
n.
The science concerned with the cytologic and histologic structure of abnormal or diseased tissue.


Histopathology
The study of diseased tissues at a minute (microscopic) level.
 and other traditional methods.

The microarray data also provided information on a possible new signature of acetaminophen toxicity involving the metal-lothionein gene and several others, which may be involved in the liver's antioxidant antioxidant, substance that prevents or slows the breakdown of another substance by oxygen. Synthetic and natural antioxidants are used to slow the deterioration of gasoline and rubber, and such antioxidants as vitamin C (ascorbic acid), butylated hydroxytoluene  defense system. "We didn't previously know that those genes were involved in acetaminophen toxicity, but it fits into the biological story of cell defense mechanisms," says Alexandra Heinloth, a research scientist with the NCT and lead author of the 2004 paper.

A different type of phenotypic anchoring was used in a study of pathways linked to inflammation. Microarray experiments with mouse strains that exhibited both high and low levels of response to inhaled lipopolysaccharide lipopolysaccharide /lipo·poly·sac·cha·ride/ (-pol?e-sak´ah-rid)
1. a molecule in which lipids and polysaccharides are linked.

2.
 compounds (which trigger immune responses) identified about 500 genes that were responsive in at least one of the strains. Researchers from Duke University, The Institute for Genomic Research, and George Washington University George Washington University, at Washington, D.C.; coeducational; chartered 1821 as Columbian College (one of the first nonsectarian colleges), opened 1822, became a university in 1873, renamed 1904. , including John Quackenbush, now a professor in biostatistics and computational biology Not to be confused with Biologically-inspired computing.
Computational biology is an interdisciplinary field that applies the techniques of computer science, applied mathematics, and statistics to address problems inspired by biology.
 at the DanaFarber Cancer Institute and the Harvard School of Public Health The Harvard School of Public Health is (colloquially, HSPH) is one of the professional graduate schools of Harvard University. Located in Longwood Area of the Boston, Massachusetts neighborhood of Mission Hill, next to Harvard Medical School and Cambridge, Massachusetts, , used two independent methods to filter the results of these microarray experiments, to prioritize genes for future study.

In the first method, the team identified 30 genes whose expression levels best distinguished the low- and high-responding mice. In the second method, they used quantitative trait locus Inheritance of quantitative traits refers to the inheritance of a phenotypic characteristic that varies in degree and can be attributed to the interactions between two or more genes and their environment (also called Polygenic inheritance).  (QTL QTL Quantitative Trait Loci
QTL Qualified Thrift Lender
QTL Qualcomm Technology Licensing
QTL Quality Teaching and Learning (Centers for Quality Teaching and Learning; Raleigh, NC)
QTL True Heading (radiotelegraphy) 
) analysis to find regions genetically linked to the strength of the lipopolysaccharide-induced response. When the researchers compared their 500-gene list to the QTL regions, they found a set of 28 that were both differentially expressed and genetically linked to the observed phenotypes. There was no overlap among the genes identified by these two methods.

In a report of the study published in the June 2004 issue of Genomics, the researchers acknowledge that they may miss genes with important roles with these filtering methods. They argue, however, that their approaches provided an objective way to obtain a small number of high-priority genes for future functional studies.

Beyond Microarrays

Although researchers aim to eventually link pathways from expression to metabolism, genomics research thus far has focused on microarrays because this technology is more standardized and far more widely available than methods for analyzing proteins (primarily mass spectrometry mass spectrometry
 or mass spectroscopy

Analytic technique by which chemical substances are identified by sorting gaseous ions by mass using electric and magnetic fields.
) and metabolites (mass spectrometry and nuclear magnetic resonance nuclear magnetic resonance: see magnetic resonance.
nuclear magnetic resonance (NMR)

Selective absorption of very high-frequency radio waves by certain atomic nuclei subjected to a strong stationary magnetic field.
). Although array-like assays for proteins have been developed, some using antibodies as tags, these technologies are still relatively exploratory and limited in scope, says Terry Speed, a professor in the Department of Statistics at the University of California, Berkeley The University of California, Berkeley is a public research university located in Berkeley, California, United States. Commonly referred to as UC Berkeley, Berkeley and Cal .

But microarray data have serious limitations when it comes to biological inference. Although they can show associations, such data can rarely indicate cause and effect. "What people can find across many arrays are patterns suggesting coregulation," says Speed. "If you look across hundreds of arrays and find that expression of two genes [moves] up and down together, that's highly suggestive of suggestive of Decision making adjective Referring to a pattern by LM or imaging, that the interpreter associates with a particular–usually malignant lesion. See Aunt Millie approach, Defensive medicine.  interactive behavior, It's not so much biological modeling as it is finding associations that are suggestive of biological interactions." One of the greatest limitations of microarray data reflects the underlying biology: the expression of mRNAs doesn't always translate into proteins because silencing RNAs and other mechanisms can block the translation process.

One way to bridge the gap between gene expression and protein creation is to assess the proteins in a cell through proteomics analysis. Another is to gain a better understanding of the "transcriptome The transcriptome is the set of all messenger RNA (mRNA) molecules, or "transcripts", produced in one or a population of cells. The term can be applied to the total set of transcripts in a given organism, or to the specific subset of transcripts present in a particular cell type. " (also called the "RNAome")--the expression of all regulatory elements operating to regulate the expression, stability, and translation of transcripts (strands of RNA RNA: see nucleic acid.
RNA
 in full ribonucleic acid

One of the two main types of nucleic acid (the other being DNA), which functions in cellular protein synthesis in all living cells and replaces DNA as the carrier of genetic
) in the cell. "To make genotype-phenotype correlations, you need to have a complete catalogue of transcripts that are expressed at each locus where such genotype-phenotype correlations are to be made," says Thomas Gingeras, vice president of biological science at Affymetrix.

Gingeras and other researchers at Affymetrix and the National Cancer Institute have studied the RNAome with microarrays containing probes for the entire nonrepetitive sequences, not just the coding regions, of 10 human chromosomes. Data from these arrays have demonstrated that RNA activity is extraordinarily varied and complex. Although researchers have been able to identify the roles of many sequences, such as ribosomal and protein-coding RNAs, they have had to classify a substantial number of the newly discovered transcripts as TUFs (transcripts of unknown function).

Sequences for TUFs are equally complicated. "Curiously enough, many of these transcripts are sitting in the middle of genes, overlapped on both the sense and antisense strands of coding sequences," says Gingeras. The sense, or template, strand of DNA DNA: see nucleic acid.
DNA
 or deoxyribonucleic acid

One of two types of nucleic acid (the other is RNA); a complex organic compound found in all living cells and many viruses. It is the chemical substance of genes.
 is the one that is copied or transcripted. The authors speculate that the RNAs correlating to antisense strands may be cRNA copies, created in somewhat the same way as the cDNA copies of RNA used in microarrays.

Another challenge in all types of genomics experiments is detecting signals from molecules expressed at low levels. Quantities of mRNAs and proteins in a sample can vary by a ratio of 1 million to 1. Some of these low-expression molecules could be critical triggers to biological cascades, but may be lost in the signal-to-noise ratio The ratio of the power or volume (amplitude) of a signal to the amount of unwanted interference (the noise) that has mixed in with it. Measured in decibels, signal-to-noise ratio (SNR or S/N) measures the clarity of the signal in a circuit or a wired or wireless transmission channel. .

Proteomics technologies are making progress in detecting low-expression molecules through more sophisticated sorting technologies such as SELDI-TOF SELDI-TOF Surface-Enhanced Laser Desorption/Ionization Time-Of-Flight  (a method that selects only a subset of proteins from a given sample for analysis). To detect low-expression transcripts and quantify the number of mRNAs in a given sample, researchers studying gene expression turn to methods such as RT-PCR RT-PCR

reverse transcriptase-polymerase chain reaction. See PCR1.
 (which can involve the use of controls and fluorescent markers to quantify the amount of a molecule produced during polymerase chain reaction polymerase chain reaction (pŏl`ĭmərās') (PCR), laboratory process in which a particular DNA segment from a mixture of DNA chains is rapidly replicated, producing a large, readily analyzed sample of a piece of DNA; the process is ) and SAGE (which involves marking each transcript with a unique tag and then linking and sequencing the combined transcripts to count the number of times each tag occurs).

Analysis Issues

As great as the challenges are in developing technology to detect and identify transcripts, proteins, and metabolites, the difficulties in analyzing the resulting data may be even greater. As research teams plan their experiments, they must chose from a bewildering be·wil·der  
tr.v. be·wil·dered, be·wil·der·ing, be·wil·ders
1. To confuse or befuddle, especially with numerous conflicting situations, objects, or statements. See Synonyms at puzzle.

2.
 and ever-changing assortment of statistical methods for data analysis. Speed says no one protocol will work for all experiments: "Usually there will be one method that will be preferred and often several that will be acceptable, All of them have their strengths and weaknesses."

Part of the difficulty relates to the current nature of genomics experiments. Whereas traditional statistics methods are based on the assumption that a study will have far more samples than data points per sample, genomics experiments usually involve the inverse situation: a few dozen samples, with tens of thousands of data points per sample. New statistical methods are being developed to deal with the peculiarities of genomics data. Simultaneously, other teams are working to ensure that the data to be analyzed are valid by addressing issues such as differences in experimental technologies and laboratory procedures, and revisiting the effects of sample size.

Recent studies have shown that increased standardization of microarray platforms has greatly reduced the influence of platform type on results. In a study published in the May 2005 issue of Nature Methods, Quackenbush and colleagues found that differences in microarray platforms (oligonucleotide versus spotted cDNA) did indeed affect results. However, these differences were eclipsed by a very high correlation between the platforms in expression changes caused by varying exposures to angiotensin II angiotensin II
n.
An octapeptide that is a potent vasopressor and a powerful stimulus for production and release of aldosterone from the adrenal cortex.
, a potent peptide that causes blood vessels Blood vessels

Tubular channels for blood transport, of which there are three principal types: arteries, capillaries, and veins. Only the larger arteries and veins in the body bear distinct names.
 to constrict con·strict
v.
To make smaller or narrower, especially by binding or squeezing.
. "The question we asked was, does the biology or the platform dominate?" says Quackenbush. "For more than ninety percent of genes for which we could make a reasonable comparison, we found that biology dominated platform."

The bad news is that variations in protocols (for RNA labeling, hybridization hybridization /hy·brid·iza·tion/ (hi?brid-i-za´shun)
1. crossbreeding; the act or process of producing hybrids.

2. molecular hybridization

3.
, and microarray processing), statistical methods for data acquisition and normalization In relational database management, a process that breaks down data into record groups for efficient processing. There are six stages. By the third stage (third normal form), data are identified only by the key field in their record. , and other "lab effects" can still significantly impact microarray data. This was the finding of two other studies also published in the May 2005 issue of Nature Methods, one led by Rafael Irizarry, an associate professor in the Department of Biostatistics at The Johns Hopkins University Johns Hopkins University, mainly at Baltimore, Md. Johns Hopkins in 1867 had a group of his associates incorporated as the trustees of a university and a hospital, endowing each with $3.5 million. Daniel C. , and another by researchers with the NIEHS Toxicogenomics Research Consortium (TRC TRC
Noun

(in South Africa) Truth and Reconciliation Commission: a commission which encourages people who committed human rights abuses or acts of terror during the apartheid era to reveal the truth about their crimes in return for immunity from prosecution
). Both studies compared the results of microarray analysis of identical samples performed at multiple laboratories, and both found that, with care, results can be comparable across labs. However, "you have to pay close attention to how you do things. You have to standardize your protocols from lab to lab," says Katherine Kerr, a coauthor of the TRC study and the director of the Bioinformatics and Biostatistics Facility Core at the University of Washington-NIEHS Center for Ecogenetics and Environmental Health.

The design for most microarray experiments calls for three to five samples per treatment condition. However, some statisticians say that sample size must be increased to generate the statistical power necessary to infer biological pathways. "Sample sizes in most of the current systems biology Systems biology, a field of study in the biosciences, focuses on the systematic study of complex interactions in biological systems. Particularly from 2000 onwards, the term is used widely in the biosciences, and in a variety of contexts.  experiments are not adequate to infer the kinds of complex networks that are the goal of such studies," says Gary Churchill, a staff scientist at the nonprofit Jackson Laboratory in Bar Harbor, Maine Bar Harbor, Maine, may refer to:
  • Bar Harbor (town), Maine
  • Bar Harbor (CDP), Maine, a census-designated place within the town of Bar Harbor
.

Churchill is a cofounder co·found  
tr.v. co·found·ed, co·found·ing, co·founds
To establish or found in concert with another or others.



co·found
 of the Collaborative Cross, a project to develop a panel of 1,000 new and genetically diverse mouse strains. The mice are descendants of just eight parent strains, minimizing the need for genotyping, and are being bred for maximum generic variation, allowing for a plethora of diverse yet controlled strains. Not all studies will require 1,000 mice, Churchill says, although some will. The resource is being generated to cover a wide range of needs.

Interpretation Is Key

Before teams can begin analyzing data, they must be confident that they are interpreting the raw signals accurately. For microarray data, this involves summarizing the fluorescence data for each spot--which are generated by the individual mRNAs linked to the probes into a single value for each gene. "The hardest challenge is removing the component of intensity that is due to background noise," says Irizarry. Some background noise--extraneous signals that can be confused with the signals being observed--can be caused by mismatches in the attachment of mRNA to the chip.

Another potential issue, says Irizarry, is that some of the 25-base pair probes used on oligonucleotide chips can be "stickier" than others--that is, more likely to attract mRNA. "If one gene is represented by a sequence that's sticky, it will collect more than a probe that is less sticky," he says. As a result, results from such a probe may reflect the chemistry of cDNA-mRNA bonding more than the biology of the sample.

Once values have been established for each gene in an array, the signals need to be normalized across the array. Equalizing fluorescence signals in two-color arrays is the most basic type of normalization. If equal amounts of two samples were hybridized to an array, then the total fluorescent signal from each sample should also be equivalent. If one is uniformly higher, a statistician can adjust the fluorescence values to better represent the relationship between the samples. Several more-involved processes may also be used to normalize normalize

to convert a set of data by, for example, converting them to logarithms or reciprocals so that their previous non-normal distribution is converted to a normal one.
 data.

According to the TRC study, normalization procedures seem to increase the accuracy of microarray data. But there still remain lots of unanswered questions about normalization formulas, as well as the algorithms used to analyze genomics data, says Kerr. "The data look better when we're done with [normalization]," she says. "But we don't know Don't know (DK, DKed)

"Don't know the trade." A Street expression used whenever one party lacks knowledge of a trade or receives conflicting instructions from the other party.
 if we've really made a correction."

After normalization, researchers can take a basic count of changes in gene expression. Churchill calls this "making lists." He explains: "You measure microarray data from normal tissue and from diseased tissue and ... generate lists of genes that are up or down. The problem now is how we take those lists and turn them into biological sense."

Turning Data into Sense

The next step for many teams is to shorten the list. They may focus on genes with the strongest or most closely correlated changes in expression. Often this process is informed by preexisting pre·ex·ist or pre-ex·ist  
v. pre·ex·ist·ed, pre·ex·ist·ing, pre·ex·ists

v.tr.
To exist before (something); precede: Dinosaurs preexisted humans.

v.intr.
 information about gene function. But care must be taken. Setting filters that are too tight can cause an analysis to ignore genes of importance, while setting parameters too broadly can cause false positives.

That is why different groups can have difficulty replicating results on microarray experiments, says Greg Carr, a research fellow working in product safety at Procter and Gamble. If the statistical power--or probability of detecting genes of significance--is set relatively low, say 10%, one lab may pick up on some of the low-power genes and a second lab may pick up on others, but no one lab is likely to detect them all, he says.

Another approach sometimes used with or instead of data filtering is to cluster, or group, data according to similarities in expression patterns. Methods include hierarchical or "Eisen" clustering, which produces a tree-like structure; [kappa]-means clustering, which produces line graphs; and principal components analysis, which produces a three-dimensional array that can be rotated. Clustering methods are usually exploratory and, Speed says, don't provide an answer to a well-defined question; they can only show associations among genes. However, they can provide effective ways to organize data. Scientists then have to infer cause and effect.

Various types of modeling algorithms can aid in this inference process. One of the simplest models, Boolean networks, can capture multivariate gene relationships that can be inferred from measurement data. Ilya Shmulevich, an associate professor at the Institute for Systems Biology The Institute for Systems Biology (ISB) is a non-profit research institution, located in Seattle, Washington, United States. Leroy Hood co-founded the Institute with Alan Aderem and Ruedi Aebersold in 2000. , in Seattle, has worked to increase the flexibility of Boolean network modeling through the development of probabilistic (probability) probabilistic - Relating to, or governed by, probability. The behaviour of a probabilistic system cannot be predicted exactly but the probability of certain behaviours is known. Such systems may be simulated using pseudorandom numbers.  Boolean networks. These networks allow for multiple functional possibilities for each gene, mimicking underlying biological and measurement uncertainty, says Shmulevich. He and his colleagues have applied probabilistic Boolean network analysis to gene expression data from studies of melanomas and gliomas.

When using most modeling methods, "you have to put in a rather limited set of genes and then learn something about that set," says Speed. The art of assigning genes to modeling programs and deciding how to filter the results of other genomics data draws heavily on preexisting knowledge about the systems in general. Researchers comb through the scientific literature or databases of genetics, proteomics, and metabolomics data. However, database mining can only take researchers so far. Only 60-65% of human genes have been adequately annotated as to function, says Raymond Tennant, director of the NCT. This makes it difficult to infer the function of genes that haven't yet been annotated.

The Chemical Effects in Biological Systems Knowledgebase, set to become publicly available in late 2005, will provide access to microarray data for about 140 reference compounds, and comprehensive data sets on about 10 hepatotoxicants, including acetaminophen. "The database will also include reference information on the biological effects of chemicals and other agents, and pathways related to their mechanism of action," says Michael Waters, NCT assistant director for database development.

Speaking the Language of Inference

Beyond having data in hand, biological inference requires a wide range of skills and expertise. "There's a need for transdisciplinary efforts--notice I said transdisciplinary rather than interdisciplinary," says Kenneth gamos, chair of the Department of Biochemistry and Molecular Biology molecular biology, scientific study of the molecular basis of life processes, including cellular respiration, excretion, and reproduction. The term molecular biology was coined in 1938 by Warren Weaver, then director of the natural sciences program at the Rockefeller  at the University of Louisville See also
  • The University of Louisville Cardinal Singers
  • The University of Louisville Collegiate Chorale
  • History of Louisville, Kentucky
  • McConnell Center
References

1. ^ [1]
2. ^ [2] URL accessed on June 8 2006
3.
 and EHP's toxicogenomics editor. "We need people who speak more than one language, a new type of scientist. We've become so specialized that it is difficult to cross disciplines with fluidity. I think that research in biological inference will demand that ability. Classically trained biologists will need to reengage their appreciation and understanding of mathematics in order to begin to tackle some of these questions."

Biologists may also need to have a better understanding of, and possibly training in, statistics. "Taking Stats 101 isn't necessarily going to imbue im·bue  
tr.v. im·bued, im·bu·ing, im·bues
1. To inspire or influence thoroughly; pervade: work imbued with the revolutionary spirit. See Synonyms at charge.

2.
 the concepts you need," says Churchill. "Statistics isn't about the formulas, how to crunch numbers. It's about the concepts. It's about how to quantify uncertainty, about how to take data and turn it into knowledge."

"It would be beautiful if you had that [statistical] training when you start in the field," says Heinloth. "But if you include enough statisticians and bioinformaticists on your team as equal members, you can have that covered." In Heinloth's research group, statisticians are involved in experiments from the beginning. "There's hardly any decision in a study that's made by only one person," she says. However, she notes that this collaborative process is not "science by committee." As experiments are designed and implemented, team members weigh in only in their areas of expertise.

As researchers delve into this enormous quantity of data, they are confronted with the limits of human cognitive ability. It is not possible for a single individual to fully comprehend the astonishing a·ston·ish  
tr.v. as·ton·ished, as·ton·ish·ing, as·ton·ish·es
To fill with sudden wonder or amazement. See Synonyms at surprise.
 complexity of metabolism in even a single cell type. So as researchers develop new technologies and new statistical tools for genomics research, and for inferring the ramifications ramifications nplAuswirkungen pl  of the data they uncover, they also need to find new ways to approach the limits of their own understanding.

"We need to simplify to understand complexity, to start off with the exploration and understanding of simple systems. If you try to look at the complexity first off, you'll never really unravel it," says Ramos. There's a lot of potential for error in starting with simple systems, he adds. But it's a way to start, to gradually build up to more complex models.

"As we accumulate more data, we're understanding how limited our understanding is and how much more there is to discover," says Churchill. "That can be discouraging from a diagnostic sense, but it's wonderful to have a new universe opening up in front of you."
COPYRIGHT 2005 National Institute of Environmental Health Sciences
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 2005, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Title Annotation:Environews / Focus
Author:Freeman, Kris
Publication:Environmental Health Perspectives
Date:Jun 1, 2005
Words:3515
Previous Article:Arsenic: in search of an antidote to a global poison.(Environews / Focus)(Cover Story)
Next Article:By order of the court: environmental cleanup in India.(Environews / Spheres of Influence)



Related Articles
Systems toxicology and the chemical effects in biological systems (CEBS) knowledge base.
Taking stock of toxicogenomics: mini-monograph offers overview.(Science Selections)
Database development in toxicogenomics: issues and efforts.(Genomics and Risk Assessment: Mini-Monograph)
An Introduction to Toxicogenomics.
Drawing comparisons at Duke.(NCT Update)
Toxicogenomics through the Eyes of Informatics: conference overview and recommendations.(Meeting Report)
Ushering in the new toxicology: toxicogenomics and the public interest.(Commentary)
A vision that challenges dogma gives rise to a new era in the environmental health sciences.(Essay on: Toxicogenomics)
Meeting report: Validation of Toxicogenomics-Based Test Systems: ECVAM-ICCVAM/NICEATM considerations for regulatory use.(Research)
TRC: mission accomplished.(NIEHS News)

Terms of use | Copyright © 2010 Farlex, Inc. | Feedback | For webmasters | Submit articles