Systems biology: the picture.Genomics, proteomics, and metabolomics have all vastly advanced our understanding of human biology Human biology is an interdisciplinary academic field of biology, biological anthropology, and medicine which focuses on humans; it is closely related to primate biology, and a number of other fields. and disease. But the functioning of even a simple system such as a single yeast cell or bacterium is much more complicated than the sum of its genes or proteins or metabolites Metabolites Substances produced by metabolism or by a metabolic process. Mentioned in: Interactions ; it's the activity of all those components and their relationships to one another that add up to a living organism. Recognizing that complexity, the emerging field of systems biology Systems biology, a field of study in the biosciences, focuses on the systematic study of complex interactions in biological systems. Particularly from 2000 onwards, the term is used widely in the biosciences, and in a variety of contexts. attempts to harness the power of mathematics, engineering, and computer science to analyze and integrate data from all the "omics" and ultimately create working models of entire biological systems. "Traditionally, scientists--toxicologists included--have relied on a reductionist re·duc·tion·ism n. An attempt or tendency to explain a complex set of facts, entities, phenomena, or structures by another, simpler set: "For the last 400 years science has advanced by reductionism ... approach to biology," says William Suk SUK Sveriges Unga Katoliker (Swedens Young Catholics) , director of the NIEHS NIEHS National Institute of Environmental Health Sciences (NIH, DHHS) Center for Risk and Integrated Sciences. Even now, many studies examine complex systems by looking at cellular components in isolation. For instance, a common experiment involves using DNA microarrays to observe the effect of a chemical exposure on thousands of genes at once. This technique can quickly tell a scientist which genes may be vulnerable to that exposure. But a systems biology approach would attempt to model not only the chemical's effect on gene expression but also how that expression will affect protein function, and in turn how the exposure will affect cell signaling Cell signaling is part of a complex system of communication that governs basic cellular activities and coordinates cell actions. The ability of cells to perceive and correctly respond to their microenvironment is the basis of development, tissue repair, and immunity as well as . "There's nothing wrong with what we've been doing," Suk says. "But systems biology is going to take it to another level." Building a New Science From one perspective, systems biology is nothing new. At the turn of the twentieth century, physiologists such as Walter B. Cannon were developing the concept of homeostasis--the self-regulatory mechanisms, hunger and thirst Hunger and Thirst (French original title La Soif et la faim) is one of the last plays by Eugène Ionesco. It was first published in French in 1966. The play has one act divided into four periods. for example, that a living organism uses to keep its internal systems in balance despite an ever-changing external environment. The term "systems biology" was first used in the 1960s, when theoretical biologists began creating computer-run mathematical models of biological systems. But the field took a leap forward beginning in the 1990s, when the high-throughput tools developed for the sequencing of the human genome The human genome is the genome of Homo sapiens, which is composed of 24 distinct pairs of chromosomes (22 autosomal + X + Y) with a total of approximately 3 billion DNA base pairs containing an estimated 20,000–25,000 genes. brought experimental scientists up to the speed of theoretical biologists. The widespread use of the Internet has also made possible for the first time the international collaborations and sharing of huge amounts of data that systems biology requires. "The way that computer science has responded to genomics is one of the great stories of the sociology of twentieth-century science," says Charles DeLisi ![]() Charles DeLisi is the Metcalf Professor of Science and Engineering at Boston University, and also served as Dean of the College of Engineering from 1990 to 2000. , senior associate provost for bioscience and chair of the Bioinformatics Program at Boston University Boston University, at Boston, Mass.; coeducational; founded 1839, chartered 1869, first baccalaureate granted 1871. It is composed of 16 schools and colleges. . Computer scientists have taken a great interest in biology and have stepped up to collaborate with biologists to develop the tools needed to sequence genomes and analyze the resulting data. Leroy Hood Leroy Hood is an American biologist. He won the 2003 Lemelson-MIT Prize for inventing "four instruments that have unlocked much of the mystery of human biology" by helping decode the genome. , a biochemist who is president and cofounder co·found tr.v. co·found·ed, co·found·ing, co·founds To establish or found in concert with another or others. co·found of the nonprofit Institute for Systems Biology The Institute for Systems Biology (ISB) is a non-profit research institution, located in Seattle, Washington, United States. Leroy Hood co-founded the Institute with Alan Aderem and Ruedi Aebersold in 2000. , agrees. "What uniquely defines the systems biology that I'm thinking about has really come from the genome project genome project 1 The Human Genome Project, see there 2. A general term for a coordinated research initiative for mapping and sequencing the genome of any organism and its delineation of a complete parts list of all the genes," he says. "If you know all the genes, you have the ability to do DNA DNA: see nucleic acid. DNA or deoxyribonucleic acid One of two types of nucleic acid (the other is RNA); a complex organic compound found in all living cells and many viruses. It is the chemical substance of genes. arrays, follow the behavior of all the messenger RNAs, and even the proteins, in principle." Measuring gene expression is one important component of systems biology, and methods for doing so are fairly well developed for the needs of this field. However, proteomics--the science of analyzing all the proteins present in a system at any one time--still has some maturing to do before scientists can integrate its data payload into a true systems approach. Proteomics has been hailed as having even more potential than genomics, because whereas DNA is a set of static instructions for an organism, proteins--the machines that actually carry out the work--are a more fluid medium and may reflect the effects of chemical exposure more accurately. But to integrate protein expression into systems biology, scientists need to better understand its relationship to gene expression. DeLisi says scientists still don't understand why in many cases there is a tight correlation between gene and protein expression, while in others (as with transcription factors) the correlation is very loose. "That [understanding] will develop over the next five to ten years," he predicts. Other researchers have pointed to the need for more quantitative techniques to not only detect the presence of proteins but also determine their size, purity, and concentration in a system. Hood agrees that to achieve truly global analyses of complex biological systems, proteomics technology needs more development. "The big problem that proteomics has is that proteins that are expressed at very low levels are generally invisible to the analytic techniques we have," he says. Hood suggests the answer to that problem lies in highly miniaturized sensors and detectors developed through nanotechnology and microfluidics. Another "omics" that promises to fill in gaps in systems biology models is metabolomics--the evaluation of tissues and biofluids such as urine, blood plasma blood plasma n. The yellow or gray-yellow, protein-containing fluid portion of blood in which the blood cells and platelets are normally suspended. , and saliva for metabolite metabolite, organic compound that is a starting material in, an intermediate in, or an end product of metabolism. Starting materials are substances, usually small and of simple structure, absorbed by the organism as food. changes that may result from environmental exposures or from disease. Because metabolites (which include carbohydrates, amino acids, and lipids) are the actual by-products of processing food into energy, this "omics" has the potential to paint a picture of what has actually happened in the cell. Work in metabolomics seeks to go beyond sampling single metabolites to developing profiles of four or five related metabolites. But to create meaningful metabolite profiles, scientists need better tools for measuring tiny amounts of metabolites and for determining which are most important in the activity of the cell. Existing analytical tools such as nuclear magnetic resonance nuclear magnetic resonance: see magnetic resonance. nuclear magnetic resonance (NMR) Selective absorption of very high-frequency radio waves by certain atomic nuclei subjected to a strong stationary magnetic field. spectrometry and mass spectrometry mass spectrometry or mass spectroscopy Analytic technique by which chemical substances are identified by sorting gaseous ions by mass using electric and magnetic fields. go only so far. Indeed, the Metabolomics Technology Development initiative of the NIH "Not invented here." See digispeak. NIH - The United States National Institutes of Health. Roadmap for Medical Research encourages projects that develop new methods for measuring metabolites that are present only in low concentrations or at specific subcellular sub·cel·lu·lar adj. 1. Situated or occurring within a cell: subcellular organelles. 2. Smaller in size than ordinary cells: subcellular organisms. 3. locations. The Measure Is in the Models For those working on a systems biology approach, the goal of developing these various "omics" technologies is to combine their data into interactive models. Hood says, "The ultimate object of systems biology is to understand how the elements and their interactions together give rise to the emergent properties of the system." To do that, scientists begin by modeling individual components such as protein networks and signal transduction Signal transduction The transmission of molecular signals from a cell's exterior to its interior. Molecular signals are transmitted between cells by the secretion of hormones and other chemical factors, which are then picked up by different cells. pathways. Initially, says Hood, the models are descriptive. They may involve perhaps a relatively simple equation showing relationships between a few proteins in a cell. As more information comes to light, the models become more graphical. A graphical model In probability theory, statistics, and machine learning, a graphical model (GM) is a graph that represents independencies among random variables by a graph in which each node is a random variable, and the missing edges between the nodes represent conditional independencies. may visualize a cell as a very complicated flow chart, as a series of pinwheels, or as a spiderweb (tool) Spiderweb - A program for creating versions of Knuth's WEB self-documenting programs ("literate programming"). ftp://princeton.edu/. . Relationships between elements are depicted through color or distance. Next, researchers can experiment with an actual system such as yeast to see what will happen at the organism level when one component of the system is perturbed per·turb tr.v. per·turbed, per·turb·ing, per·turbs 1. To disturb greatly; make uneasy or anxious. 2. To throw into great confusion. 3. . "You can do genetic perturbations where you knock out genes, for example, or environmental perturbations where you give or take away certain kind of sugars," Hood says. "And then you observe how all the other elements behave in response to those perturbations." Those experiments will usually yield data that aren't explained by the model. "So you formulate hypotheses to explain the discrepancies, and you go back and do more of these global and integrated experiments," Hood says. It's a long way from modeling yeast to modeling a human being. But Hood believes that the knowledge gained from a model of a simple system can be scaled up. Such comparative genomics--the ability to learn about complex systems by modeling simpler systems that have similar genetics--is one of the most powerful tools in systems biology, Hood says. "The basic idea is that once evolution latches on to a good idea, it generally uses it over and over again," he says. "But when you do these comparisons you have to be very aware that although the very basic strategy may be similar, there might be many elaborations." So scientists must look for "control elements," small elements such as binding sites for transcription factors that are conserved in both organisms. The next step in modeling is to describe relationships mathematically. The Institute for Systems Biology has created a mathematical model of the relatively simple metabolic process Noun 1. metabolic process - the organic processes (in a cell or organism) that are necessary for life metabolism organism, being - a living thing that has (or can develop) the ability to act or function independently by which yeast cells get energy by breaking down the sugar galactose. After defining all the genes in the yeast genome and the particular genes, proteins, and other small molecules that are known to play a role in the galactose metabolism pathway, the researchers created a model. Then they grew, in the presence of galactose and without, normal yeast as well as strains that had particular genes deleted. They analyzed the reactions using microarrays, quantitative proteomics, and databases of known physical interactions. "We can write down a series of differential equations, and we can choose parameters to put into those differential equations and can predict a lot of the behavior of the system," Hood says. Some scientists believe that current bioinformatics capabilities can handle many biological modeling tasks if a complex system is portrayed as a series of smaller models that overlap one another. But the experimental design must integrate all the "omics" from the beginning. "The phrase 'data integration' has been closely associated with buying big mainframes and having engineers design big databases," says Eric Neumann, global head of knowledge management at Aventis Pharmaceuticals. All of this computational power is aimed at trying to combine information from experiments that don't have an integrated design--for instance, trying to relate gene expression data from one study to protein data collected in a separate study. Neumann says this integration can be done better for one-tenth of the cost by adopting good experimental design and focusing more on the downstream. Neumann's ideal experiment involves collecting gene microarray data, protein levels, metabolite levels, clinical phenotypes, and serum biomarkers in one experiment. "If one compares across experiments, then unless everything is kept constant, the data may not be statistically equivalent," he says. "Some high-level comparisons may be made if one accepts the independent interpretations from both studies, but these will often be more qualitative mergings than quantitative in nature." The integrated experiment design that Neumann favors would allow statistically valid merging, he says. That merging can be accomplished with several simple databases that all point to the same experimental design. The end result is a relational database that "looks like a big spiderweb," Neumann says. John Doyle, a professor of control and dynamical systems Dynamical Systems A system of equations where the output of one equation is part of the input for another. A simple version of a dynamical system is linear simultaneous equations. Non-linear simultaneous equations are nonlinear dynamical systems. at the California Institute of Technology California Institute of Technology, at Pasadena, Calif.; originally for men, became coeducational in 1970; founded 1891 as Throop Polytechnic Institute; called Throop College of Technology, 1913–20. , has likened the complexity of a biological system to that of a Boeing 777 jet. Each system needs only a small portion of its control systems for basic functioning (for instance, Escherichia coli Escherichia coli (ĕsh'ərĭk`ēə kō`lī), common bacterium that normally inhabits the intestinal tracts of humans and animals, but can cause infection in other parts of the body, especially the urinary tract. can survive in the laboratory even when 90% of its genes are knocked out). The jet includes more than 100,000 components such as computers and sensors, most of which are not needed under ideal conditions, but which enable the plane to stabilize if conditions suddenly change. Likewise, biological organisms include complex control systems that kick in only during potential threats--such as variations in temperature or nutrients--to keep the organism stable. In general, this complexity makes the organism robust. But some scientists hypothesize hy·poth·e·size v. hy·poth·e·sized, hy·poth·e·siz·ing, hy·poth·e·siz·es v.tr. To assert as a hypothesis. v.intr. To form a hypothesis. that such complexity can leave a system vulnerable to unplanned disruptions such as genetic mutation. The mutation may be tiny, but because the gene is involved in such a complex, multilayered control network, the tiny mutation can trigger a "cascading failure"--a kind of domino effect that leads to a major threat such as cancer or autoimmune disease autoimmune disease, any of a number of abnormal conditions caused when the body produces antibodies to its own substances. In rheumatoid arthritis, a group of antibody molecules called collectively RF, or rheumatoid factor, is complexed to the individual's own gamma . Mathematicians and engineers are at work on algorithms and other tools to better model this robust yet fragile nature and other aspects of complex biological systems. High Hopes for Tiny Tools Some researchers say that developments in nanotechnology and microfluidics may revolutionize systems biology. Nanotechnology involves manipulating molecules smaller than 100 nanometers--the scale of viruses. Microfluidics, which is commonly used in ink-jet printing, uses pumps and valves to transport nanoliter volumes of fluids through microchannnels in a tiny glass or plastic chip. Hood says that nanotechnology and microfluidics will eventually enable scientists to make many different measurements in parallel and in small amounts of material. "In principle, you can make these measurements down to the single-molecule level," he says. In theory, researchers could create nanobiosensors no bigger than 100 nanometers that could be surgically implanted into the body or injected into the bloodstream to measure biomarkers of environmental exposure or diagnose problems in cell function. Suk says nanobiosensors could potentially measure processes as sensitive as the flux of calcium ions inside a cell. David Walt, a professor of chemistry at Tufts University, is using fiber optic technology to develop tiny sensors that could be used to screen toxicants. Optical fibers are extremely fine strands of glass that can transmit light to and from a sample. Right now the sensors directly measure chemical changes in arrays of yeast or E. coil cells in response to toxicants such as mercury or the chemotherapeutic agent chemotherapeutic agent An agent used to treat CA, administered in 'regimens'-one or more 'cycles' that combine 3 or more agents over wks; CAs are toxic to any cell with a high rate of proliferation–the CA itself, the GI tract–causing N&V, mitomycin C mitomycin, mitomycin C a group of highly toxic antineoplastics (mitomycin A, B and C) produced by Streptomyces caespitosus, indicated for palliative treatment of certain neoplasms that do not respond to surgery, radiation and other drugs. . The cells are fluorescently labeled, and the researchers monitor chemical changes by correlating those changes to changes in fluorescent intensity. "The goal is to be able to measure solutions and determine their toxic potential," Walt says. A similar method could eventually be used to quickly screen potential toxicants, replacing some animal testing. First, though, studies would need to be run to ensure that the cell-based arrays consistently yield the same results as animal testing, Walt says. Suk is excited about the idea that nanobiosensors could enter cells and make direct measurements of their inner workings. "That will allow us to have a better understanding--maybe even a complete understanding for certain types of tissues-of how cells and systems communicate with each other," he says. "If you can understand how a cell works, you can then scale up to tissues, then organs." He predicts that nanobiosensors will make it much easier to measure human exposures in as soon as five years. A Surprising Challenge Once all the measurements are made, what must happen to make the megamodels of systems biology a reality? Surprisingly perhaps, some of the players in the field say that the biggest challenge for systems biology isn't technical--rather, it's a matter of community. One problem is the lack of a common language for systems biology. As with other multidisciplinary collaborations, all the players will need to develop a language they can share. That became apparent at a December 2003 retreat on systems biology that Suk organized for the NIEHS Division of Extramural extramural /ex·tra·mu·ral/ (-mur´il) situated or occurring outside the wall of an organ or structure. extramural situated or occurring outside the wall of an organ or structure. Research and Training, where speakers included an engineer, a biochemist, a computer scientist, and a physician--each of whom approaches the field from his or her own perspective. Neumann agrees the biggest challenges for systems biology are human ones--language and data sharing. "There's a side to it that involves data analysis by people who feel comfortable looking at data, writing programs, some numerics," he says. But that analysis needs interpretation from scientists who know about disease and about environmental exposures. Neumann believes that getting the relevant data to those who can interpret it is the biggest bottleneck for the field. "There are more papers than ever before in science, and most of us can't read them all," he says. Simple text queries such as those used to search literature databases capture words or phrases out of context. But it's possible now, Neumann says, to populate scientific papers with embedded, machine-readable phrases that convey the relevance of the work. Such databases would use ontologies--formal, machine-readable definitions of terms and the relationships between those terms. Programs that know something about logic and relationships can help weed out irrelevant information, Neumann says, and help reveal connections between concepts. In the 2004 Proceedings of the Pacific Symposium on Biocomputing The Pacific Symposium on Biocomputing (PSB) is a scientific meeting held annually since 1996. The conference is unusual in that its sessions are determined annually on the basis of proposals submitted by interested scientists, with the goal of presenting research only on , Daniel McSban of the University of Colorado University of Colorado may refer to:
Detoxification is one of the more widely used treatments and concepts in alternative medicine. It is based on the principle that illnesses can be caused by the accumulation of toxic substances (toxins) in the body. pathways would metabolize me·tab·o·lize v. 1. To subject to metabolism. 2. To produce by metabolism. 3. To undergo change by metabolism. metabolize to subject to or be transformed by metabolism. ethyl ethyl (ĕth`əl), CH3CH2, organic free radical or alkyl group derived from ethane by removing one hydrogen atom. and furfuryl alcohol. The model's prediction correlated with known patterns of alcohol metabolism. Automated data mining tools are well on their way to development now. "Vendors are working on various applications that would support the enhanced linking of documents anywhere, and eighty to ninety percent of that [technology] already exists," Neumann says. [For an example of one such tool, see "Literature Searchlight," EHP EHP abbr. 1. effective horsepower 2. electric horsepower 112:A872 (2004).] The real test is how willing scientists are to use these tools. "We already take enough time footnoting our papers. Imagine if those footnotes were machine-readable," Neumann says. Authors would use software that works like a spellchecker to screen their papers and choose the formal concepts that are most relevant. "So all of a sudden the whole search and review of literature changes overnight," he says. "Right now we all are doing text mining and extraction by ourselves. But in the future it will be done by authors as they submit their papers." Neumann believes that PubMed Central and other efforts to make all federally funded research freely available will provide an opening for data mining to become commonplace. "To survive, the scientific publishers are going to have to ask, 'What's our added value?' If added value can be putting [text] in a smart ontology--bingo. I think we will see quick embracement of this so that [scientific publishers] find a whole new market strategy," he says. DeLisi adds that more emphasis on computational and mathematical training for scientists will help make systems biology more mainstream. "Most of us in this area now have educated ourselves," he says. "People end up learning what they need to learn to solve the problems they're interested in solving." Several training programs in bioinformatics and systems biology exist, such as the one that DeLisi is involved in at Boston University and those at various University of California The University of California has a combined student body of more than 191,000 students, over 1,340,000 living alumni, and a combined systemwide and campus endowment of just over $7.3 billion (8th largest in the United States). campuses. Other programs are under development. 'Tin very optimistic," DeLisi says. "Over the next ten to fifteen years I expect to see a very large shift toward a much more mathematical biology, and certainly toward a highly computationally intensive biology. 1 have no doubt that in the next ten to twenty years TWENTY YEARS. The lapse of twenty years raises a presumption of certain facts, and after such a time, the party against whom the presumption has been raised, will be required to prove a negative to establish his rights. 2. , biology will be the most computational of all the sciences." Optimistic Predictions Suk's biggest hope for systems biology is that it will create more realistic models of complex environmental exposures. "We're exposed to lots of chemicals but at very low concentrations over time," he says. "We need tools to help us understand how complex exposures perturb complex systems." Such tools to help elucidate system interrelationships are forthcoming. For example, in the August 2004 toxicogenomics issue of EHP, Hiroyoshi Toyoshiba and colleagues at the NIEHS Laboratory of Computational Biology and Risk Analysis reported the creation of a statistical software program that quantitatively sorts gene expression data to identify which genes interact in a network. The team has used the program to determine the effect of the carcinogen carcinogen: see cancer. carcinogen Agent that can cause cancer. Exposure to one or more carcinogens, including certain chemicals, radiation, and certain viruses, can initiate cancer under conditions not completely understood. tetrachlorodibenzo-p-dioxin on 11 genes in lung epithelial cells Epithelial cells Cells that form a thin surface coating on the outside of a body structure. Mentioned in: Corneal Transplantation and the genes' subsequent effect on the retinoic acid retinoic acid /ret·i·no·ic ac·id/ (ret?i-no´ik) an oxidized derivative of retinol, believed to be the form of vitamin A that plays a role in the development and growth of bone and in the maintenance of normal epithelial structures. signal transduction pathway. This program looks further than comparing a simple pair of genes. Instead it shows the relationships between a whole set of genes in a network. The study authors have said that when the toot is expanded to include other data such as protein levels, it will help researchers understand biopathways in cells, tissues, and eventually entire organisms. David Schwartz, a professor of medicine and genetics at Duke University, acknowledges the benefit of the systems biology approach but tempers his enthusiasm with a focus on the here and now. "Systems biology may help us understand biological processes, but we have to put them into a context of human disease," he says. Schwartz's lab investigates how gene expression profiles can be used as preclinical markers to help understand the biology and genetics of complex environmentally related diseases such as asthma. "We can also use global 'omic' approaches to identify biologic pathways that are specifically affected by disease-based environmental toxicants," he says. But Suk is optimistic that systems biology will deliver on its promise for environmental health in the near future. "We're only limited by two things--by our ability to [grasp] it and by money," he says. A working model of an entire biological system would possess enormous power for learning how environmental exposures result in disease. And some scientists say that many elements of such a model are within science's grasp. But fully embracing the systems approach will also require scientists to embrace change. They must create a new language for the field. They must design experiments always with the whole system in mind. They must even learn a new way of footnoting their papers. How will all this change happen? Like the intricate webs of systems biology models themselves, the answer is sure to be complex. |
|
||||||||||||||||


Printer friendly
Cite/link
Email
Feedback
Reader Opinion