Proteomics: characterizing the cogs in the machinery of life.Now that human genome The human genome is the genome of Homo sapiens, which is composed of 24 distinct pairs of chromosomes (22 autosomal + X + Y) with a total of approximately 3 billion DNA base pairs containing an estimated 20,000–25,000 genes. sequence is complete, the quest to extract beneficial knowledge from it is on. One of the most promising, active areas of exploration lies in the human proteome--the global expression of proteins, those marvelouus strings of amino acids responsible for all human biologic processes. Proteins are life, and the recently developed ability to study them on a large scale, quantitatively and qualitatively, is known as proteomics.
The proteome pro·te·ome
The complete set of proteins that are produced by the genes of an organism.
the entire complement of proteins produced by a cell. may never be completely solved in the same way the genome was. The genome is relatively static, and presented a finite end point. The proteome is dynamic, changing constantly with time and conditions, with proteins interacting to networks and pathways to respond to stimuli and carry on the endless business of cellular function. The challenge of completely mapping the proteome is widely considered to be several orders of magnitude greater than that of the genome. The picture is so complex and so dynamic that some proteomics experts question the very concept of the existence of a measurable human proteome. Famed genomicist J. Craig Venter The introduction of this article is too short.
To comply with Wikipedia's lead section guidelines, it should be expanded. put this doubt succinctly when he told the 5 April 2001 Wall Street Journal that "there ain't nosuch thing as a proteome."
Nevertheless, there can be no debate that proteomics is poised to deliver vast amounts of useful information about physiologic function at the subcellular sub·cel·lu·lar
1. Situated or occurring within a cell: subcellular organelles.
2. Smaller in size than ordinary cells: subcellular organisms.
3. , cellular, organic, and systemic levels, yielding profound new insights into disease and drug mechanisms, the effects of environmental exposures, and much more. Although a comprehensive map of the entire human proteome may never be accomplished, protein maps of human organs, glands, and fluids and of entire less-complex organisms are within sight, and major efforts are under way to document many of those proteomes.
Evolution of Proteomics
In large measure, proteomics has emerged in parallel fashion with the other "-omics" fields such as transcriptomics and metabonomics. The technologies, methodologies, and grand ambitions of the Human Genome Project have rapidly proliferated and now permeate virtually every area of the life sciences.
Just as the advent of genomics brought the ability to discover large numbers of genes quickly, proteomics was born when technologic advances allowed scientists to widen their focus from the painstaking isolation and identification of single proteins to a more comprehensive view of the entire protein complement expressed in a given cell line, tissue, or organism. However, proteomics researchers employ their own unique mix of tools, approaches, and skills to address the questions they seek to answer.
Although the term "proteomics" did not exist until 1994, when Australian postdoctoral student Marc Wilkins Marc Allen Wilkins (born October 21, 1970 in Mansfield, Ohio) is an American former Major League Baseball player. A pitcher, Wilkins played for the Pittsburgh Pirates from 1996 to 2001. coined it, the practice of the science has been going on since the mid-1970s. Two milestone technologic breakthroughs facilitated the ability to look at multiplicities of proteins, both of which, although much refined, are still in wide use today in laboratories around the world.
At its core, proteomics is all about separation and identification--the process of taking a sample of interest, separating out all of the proteins therein, and then identifying them. The first major breakthrough, which was a great leap forward Great Leap Forward, 1957–60, Chinese economic plan aimed at revitalizing all sectors of the economy. Initiated by Mao Zedong, the plan emphasized decentralized, labor-intensive industrialization, typified by the construction of thousands of backyard steel in separation, took place in 1975 with the introduction of two-dimensional gel electrophoresis Two-dimensional gel electrophoresis, abbreviated as 2-DE or 2-D electrophoresis, is a form of gel electrophoresis commonly used to analyze proteins. Mixtures of proteins are separated by two properties in two dimensions on 2D gels. (2DE).
With this method--still the first step in many proteomics experiments--proteins from a sample are separated on a polyacrylamide gel pol·y·a·cryl·a·mide gel
A hydrated polymer consisting of a long chain of amide groups, used as a medium for substances that undergo gel electrophoresis. according to their mass and charge, which, along with intensity, are what provide the spectrum that makes up a protein's distinctive signature. The more abundant the protein, the larger and more intensely staining the spot on the gel.
The only problem is that 2DE, while it allows separation and visualization of the protein complement, does little or nothing to address identification. Regardless, the advent of 2DE was so exciting that in 1980 it spawned the proposal of a Human Protein Index project--an effort to catalog all human proteins and then use that knowledge to define the genome (although Congress considered the project, it was never funded, and advances in genomics soon bypassed the idea).
The second major breakthrough, which really brought proteomics into its own, was the arrival of two crucial techniques in the 1980s that made possible the use of mass spectrometry mass spectrometry
or mass spectroscopy
Analytic technique by which chemical substances are identified by sorting gaseous ions by mass using electric and magnetic fields. (MS) to identify proteins: matrix-assisted laser desorption/ionization Matrix-assisted laser desorption/ionization (MALDI) is a soft ionization technique used in mass spectrometry, allowing the analysis of biomolecules (biopolymers such as proteins, peptides and sugars) and large organic molecules (such as polymers, dendrimers and other (MALDI MALDI Matrix-Assisted Laser Desorption/Ionization ) and electrospray ionization (ESI (Edge Side Includes) A markup language for Web pages that enables elements of a Web page to be dynamically assembled in servers distributed throughout the Internet. ). These methods allow protein samples to be ionized i·on·ize
tr. & intr.v. i·on·ized, i·on·iz·ing, i·on·iz·es
To convert or be converted totally or partially into ions.
i for analysis in a mass spectrometer, producing a pattern called a mass spectrum. These mass spectra--which often number in the thousands for a given sample--can then be used to positively identify proteins or protein digests (strings of peptides or protein fragments produced when the proteins are ionized) through the automatic querying of protein databases. Unidentified or novel proteins can be analyzed through further MS runs or by other techniques.
"ESI and MALIDI were a quantum leap," says William Pierce, a professor of pharmacology, toxicology, and chemistry at the University of Louisville See also
1. ^ 
2. ^  URL accessed on June 8 2006
3. School of Medicine. The subsequent development of time-of-flight (TOF (Top Of Form) The beginning of a physical paper form. To position paper in many printers, the printer is turned offline, the forms are aligned properly and the TOF button is pressed. ) detection, which expanded the range of ionic molecular weights detectable by MS instruments, brought further analytic capabilities to MALDI. Today, MALDI-TOF MALDI-TOF Matrix Assisted Laser Desorption Ionization - Time of Flight is in widespread use in proteomics laboratories.
A dazzling panoply pan·o·ply
n. pl. pan·o·plies
1. A splendid or striking array: a panoply of colorful flags. See Synonyms at display.
2. of MS refinements and enhancements, as well as the development of other technologies, now facilitate the application of proteomic techniques to a highly diversified universe of research pursuits. Virtually every proteomics laboratory, whether it's connected with government, academia, or industry, seems to have its own favored technologic approach, and many have developed their own in-house methods, along with customized bioinformatics software, to help sort through and make sense of the massive amounts of data their systems generate.
"I think we need more than one platform to be able to adequately do service to measuring proteins in a global fashion," says B. Alex Merrick, head of the Proteomics Group at the NIEHS NIEHS National Institute of Environmental Health Sciences (NIH, DHHS) National Center for Toxicogenomics (NCT NCT National Childbirth Trust
NCT National Car Test
NCT North Carolina Theatre
NCT National Coordination Team
NCT Northern California TRACON
NCT Noise Cancellation Technology
NCT Network Control and Timing
NCT Nicotine Replacement Therapy ). "The proteome is constantly changing. Technologically, you're always trying to hit a moving target." Merrick cautions, however, that "because proteins have so many properties, or attributes, and we have the potential to measure them, it spawns many, many platforms, and technologically we haven't sorted out which platforms are the best ones."
The general feeling among proteomics researchers is that the field is on the verge On the Verge (or The Geography of Yearning) is a play written by Eric Overmyer. It makes extensive use of esoteric language and pop culture references from the late nineteenth century to 1955. of consolidating years of method development into a flood of knowledge. "I think we're about to transition out of the age of explorers in proteomics to the age of applications," says Daniel Liebler, a professor of biochemistry and director of proteomics at the Vanderbilt University School of Medicine. "Proteomics techniques are not going to be done just as demonstrations of powerful technology, but will really be integrated into studies in basic laboratory science, animal and nonanimal models of diseases and environmental exposures, and actually in human clinical studies as well."
Proteomics in Action: Cancer Detection
Thanks to progress in clinical proteomics, someday soon a simple blood test could hold the key to early diagnosis of certain cancers. That is one of the many goals of the Clinical Proteomics Program, a joint research effort that is codirected by biochemist Emanuel Petricoin of the U.S. Food and Drug Administration (FDA FDA
Food and Drug Administration
n.pr See Food and Drug Administration.
n.pr the abbreviation for the Food and Drug Administration. ) and pathologist Lance Liotta of the National Cancer Institute (NCI See Liberate. ).
The group has developed a method of identifying protein patterns in blood serum Blood serum
A component of blood.
Mentioned in: Bites and Stings
the residual fluid of blood after clotting has occurred. It is plasma after the fibrinogen has been removed. that is potentially indicative of the presence of a wide range of diseases. Their initial study, published 16 February 2002 in The Lancet, focused on ovarian cancer ovarian cancer
Malignant tumour of the ovaries. Risk factors include early age of first menstruation (before age 12), late onset of menopause (after age 52), absence of pregnancy, presence of specific genetic mutations, use of fertility drugs, and personal history of breast , which presently has both a poor late-stage survival rate and a poor early detection rate--a deadly combination fostering an urgent need for better diagnostic tests, especially for women at high risk for developing the disease.
The investigators used surface-enhanced laser desorption/ionization Surface-enhanced laser desorption/ionization (SELDI) is an ionization method in mass spectrometry that is used for the analysis of protein mixtures. SELDI is typically used with time-of-flight mass spectrometers and is used to detect proteins in tissue samples, blood, TOF (SELDI-TOF SELDI-TOF Surface-Enhanced Laser Desorption/Ionization Time-Of-Flight ), a variation on MALDITOF MALDITOF Matrix-Assisted Laser Desorption/Ionization Time-Of-Flight (mass spectroscopy) that incorporates protein microarrays and is particularly well suited to detecting patterns of proteins in samples. First, a "training" set of known, unblinded samples was run through the instrument--in this case, serum samples from both healthy women and women with ovarian cancer. With the high throughput of the equipment, spectra for each sample--each one containing 15,200 data points, or individual pieces of information--were quickly generated.
Next, the raw data were processed by a unique bioinformatics system that incorporates a form of artificial intelligence called a genetic algorithm. The genetic algorithm compares the patterns of protein expression in the diseased samples to those in the healthy samples, looking for Looking for
In the context of general equities, this describing a buy interest in which a dealer is asked to offer stock, often involving a capital commitment. Antithesis of in touch with. those patterns that optimally discriminate between the two. The algorithm learns as it goes in a process that involves hundreds of millions of pattern combinations and comparisons. The end product is a pattern of unidentified proteins--in this case, five--that precisely distinguishes healthy samples from diseased ones.
The next step was to run a set of known but blinded samples through the same process, and then compare the results to assess the predictive power of the patterns. In this study, the investigators achieved a sensitivity of 100%--that is, all of the cancerous samples were correctly identified, with no false negatives--and a specificity of 95%, meaning only 5% of the identifications were false positive. This was vastly superior to the 35% positive predictive value Positive predictive value (PPV)
The probability that a person with a positive test result has, or will get, the disease.
Mentioned in: Genetic Testing
positive predictive value in the same samples of cancer antigen 125, the present gold standard clinical biomarker.
Subsequent technical refinements (which included a switch to a much higher-resolution and more stable mass spectrometer, and incorporation of advanced spectral quality control methods) have improved the system's sensitivity and specificity to 100% in a larger blinded set of ovarian cancer and high-risk samples. The team is currently enrolling participants in a clinical trial to test their methodology in detecting recurrence of ovarian cancer.
Although the proteins in the discriminatory pattern generated by this method are at least initially unidentified, Petricoin says that is beside the point. "We as scientists want to understand what the nature of the beast Nature of the Beast is the ninth episode of The WB television series Birds of Prey. The episode aired on December 18, 2003. Summary
When Al Hawke, her mother's killer, is hunted by The Specialist - a metahuman assassin with the ability to pass through solid is, and we're hunting that down," he says. "We're already making great progress to that end. But we don't see identity as necessary for its use in diagnostics."
Two of the world's largest reference laboratory companies apparently agree. Quest Diagnostics and LabCorp have sublicensed the technology from Correlogic Systems (which developed the initial genetics algorithm and licensed the technology from the U.S. government), and plan to start offering the proteomic test as an ovarian cancer screening tool for women at high risk by the end of 2003. Initially, they will market the procedure under the FDA's "home brew Products that are developed at home by hobbyists. " provision, which allows the companies to perform the service only in their own validated laboratories.
Petricoin is optimistic that proteomic pattern diagnostics could impact medical diagnostics in a big way. "This is a different type of diagnostic paradigm," he says. "[It] completely changes and turns on its head the normal tried-and-true route--which we would suggest is failing--of looking at discovery biomarkers.... I think if either our or LabCorp/Quest's efforts are successful, it's really going to throw a gauntlet down on a completely different type of diagnostic procedure being used in the clinic. That could have reverberations throughout disease detection, period."
The FDA/NCI group is applying this proteomic technique in similar studies of breast, lung, pancreatic, esophageal, brain, and prostate cancers, as well as efforts to detect cancer drug cardiovascular toxicity before symptoms occur, and to assess the effectiveness of molecularly targeted cancer drugs.
Another Approach: Cancer Profiling
The Mass Spectrometry Research Center at the Vanderbilt University School of Medicine, headed by Richard Caprioli, the Stanley Cohen Professor of Biochemistry, also is pursuing methods of distinguishing diseased tissue from healthy tissue, particularly in cancer. But Caprioli's group takes a very different approach, in which identification of the proteins in the affected tissues themselves, rather than in plasma, is central. It's called tissue proteome profiling, and it appears to be a powerful new tool for both diagnosis and prognosis.
Tissue proteome profiling has several advantages, including the ability to accurately detect factors such as life expectancy Life Expectancy
1. The age until which a person is expected to live.
2. The remaining number of years an individual is expected to live, based on IRS issued life expectancy tables. and tumor aggressiveness. Tissue proteome profiling could also directly identify potential targets for drug intervention, as well as contribute to understanding of mechanisms of the disease.
In their most complete study to date, published 9 August 2003 in The Lancet, Caprioli and his coworkers concentrated on lung tumors. They took hundreds of lung tumor biopsy samples and analyzed their protein complements via MALDI MS, looking at several spots on each sample, each of which generated thousands of signals in a specific pattern of proteins. Then, using a series of bioinformatics tools, they correlated the tissue proteome profiling information with known information about the individual patients, some of whom had already died of their disease.
"We found that at the first level, we could find unique sweeps of proteins that helped us actually classify the disease," says Caprioli. "So if you take the biggest set of lung tumors, non-small cell lung carcinomas, we could further classify them as adenocarcinomas, squamous cell carcinomas squamous cell carcinoma
A carcinoma that arises from squamous epithelium and is the most common form of skin cancer. Also called cancroid, epidermoid carcinoma. , and so on."
Of course, pathologists can do the same, but Caprioli says they didn't stop there. "We asked, 'Can we correlate these patterns with the life expectancy or the prognostic value of these diseases?' And it turned out to be of very high accuracy." He says the researchers could tell from the protein pattern which patients would go on to survive for long periods of time, and which patients would die of cancer--"so the aggressiveness of the disease was apparent in the protein profile."
They were further able to pick out with approximately 80% accuracy those patients whose tumors had metastasized, causing nodal Having to do with nodes. See node.
NODAL - Interpreted language implemented on Norsk Data's NORD-10 computers. Used by CERN and DESY high energy physics labs to control their accelerator hardware, PADAC and SEDAC. Included trackball input, graphics. involvement and the often inoperable inoperable /in·op·er·a·ble/ (in-op´er-ah-b'l) not susceptible to treatment by surgery.
Unsuitable for a surgical procedure. development of secondary tumors. There is presently no other method of making such a crucial clinical prediction.
Although appropriately cautious to point out that these results were in just one type of tumor study, Caprioli is excited about the possibility of using protein patterns to identify types of tumors that have an aggressive posture for nodal involvement. "It begins to get you out of just diagnosing to actual patient care, so that the clinician can now identify a high-risk group high-risk group Epidemiology A group of people in the community with a higher-than-expected risk for developing a particular disease, which may be defined on a measurable parameter–eg, an inherited genetic defect, physical attribute, lifestyle, habit, and make the appropriate therapeutic decisions," he says.
Caprioli's group is nearing completion of a similar study of brain tumors with similar results in terms of the power of tissue protein profiling for prognostication. They're also looking at diabetes mellitus diabetes mellitus
Disorder of insufficient production of or reduced sensitivity to insulin. Insulin, synthesized in the islets of Langerhans (see Langerhans, islets of), is necessary to metabolize glucose. In diabetes, blood sugar levels increase (hyperglycemia). , cardiac and pulmonary diseases, and several other conditions. Caprioli asserts that this platform, along with other clinical proteomics work, ultimately constitutes an entry point into the field of individualized in·di·vid·u·al·ize
tr.v. in·di·vid·u·al·ized, in·di·vid·u·al·iz·ing, in·di·vid·u·al·iz·es
1. To give individuality to.
2. To consider or treat individually; particularize.
3. medicine, centered around the concept that each patient's disease is unique at the molecular level. "It's a whole new way of looking at things," he says. "There's no doubt in my mind that as we collectively learn more and more about the molecular ways of diagnosing disease, of predicting disease progression, that this individualized way of looking at diseases will become more and more common."
Toxicoproteomics: Mechanisms and Biomarkers
Liebler's main focus is toxicoproteomics. His group concentrates on understanding how reactive intermediates produce deleterious effects by modifying proteins. These unstable chemical species enter a cell as a result of environmental exposures and tend to bind to to contract; as, to bind one's self to a wife s>.
See also: Bind proteins or DNA DNA: see nucleic acid.
or deoxyribonucleic acid
One of two types of nucleic acid (the other is RNA); a complex organic compound found in all living cells and many viruses. It is the chemical substance of genes. , modifying their properties in an injurious in·ju·ri·ous
1. Causing or tending to cause injury; harmful: eating habits that are injurious to one's health.
2. way and forming new biomolecules This page aims to list articles on Wikipedia that describe particular biomolecules or types of biomolecules.
This list is not necessarily complete or up to date - if you see an article that should be here but isn't (or one that shouldn't be here but is), please update the page known as adducts.
Using a form of MS called tandem MS, or MSIMS, along with a novel algorithm and proprietary bioinformatics software called Scoring Algorithm for Spectral Analysis, Liebler and his group are able to analyze the mass spectra of peptides to establish their sequences, the positions of any modifications, and, by mapping that information back onto the entire protein sequence, the sites of modification in the protein itself. Ultimately, two major questions are addressed: What are the protein targets of reactive intermediates? And what are the cellular responses to protein modification?
Answers to these questions will shed light on some of the most important avenues of contemporary research in toxicology. Liebler sees the biggest near-term payoff of this type of work as coming in two general areas. One area is the understanding of mechanisms of toxicity. The other is the identification of biomarkers of exposure. "If we can figure out what the targets of some of these environmental compounds are or what reactive intermediates come from environmental stimuli or stresses, by understanding mechanisms we then know what components of the cell or tissues might be amenable to some kind of protective intervention," he says.
As proof of principle, Liebler's team published a study in the June 2002 issue of Chemical Research in Toxicology documenting their system's ability to map hemoglobin adducts of the aliphatic aliphatic /al·i·phat·ic/ (al?i-fat´ik) pertaining to any member of one of the two major groups of organic compounds, those with a straight or branched chain structure.
adj. epoxides, a group of common industrial chemicals. The team is most interested in investigating biomarkers of oxidative stress oxidative stress,
n an imbalance of the prooxidant antioxidant ratio in which too few antioxidants are produced or ingested or too many oxidizing agents are produced. , the damaging phenomenon implicated im·pli·cate
tr.v. im·pli·cat·ed, im·pli·cat·ing, im·pli·cates
1. To involve or connect intimately or incriminatingly: evidence that implicates others in the plot.
2. in many disease processes and often the result of environmental exposure. "What we would like to do is identify some of the most abundant of these reactive intermediates that are formed tinder representative conditions either in vitro in vitro /in vi·tro/ (in ve´tro) [L.] within a glass; observable in a test tube; in an artificial environment.
In an artificial environment outside a living organism. or in animal models in vivo in vivo /in vi·vo/ (ve´vo) [L.] within the living body.
Within a living organism.
in vivo adv. , where we can manipulate oxidative stress," says Liebler.
Pierce's biomolecular MS lab at Louisville is involved in similar functional proteomics work. His group looks at subsets, or small clusters, of functionally interactive proteins. They isolate post-translational modifications, any of more than 100 different types of changes that can be made to proteins by a variety of factors after their original creation. (This partially accounts for the vastly larger number of proteins than genes.) Pierce's group also works to develop or validate biomarkers in cases of specific environmental or xenobiotic xen·o·bi·ot·ic
Foreign to the body or to living organisms. Used of chemical compounds.
A xenobiotic chemical.
any substance, harmful or not, that is foreign to the animal's biological system. exposures and those agents' interactions with nucleic acids Nucleic acids
The cellular molecules DNA and RNA that act as coded instructions for the production of proteins and are copied for transmission of inherited traits. and proteins.
In a collaboration with Louisville professor of medicine Aruni Bhatnagar, Pierce and colleagues are looking at a very large, ubiquitous set of chemicals, the aldehydes, which form adducts with proteins, potentially contributing to cardiovascular disease Cardiovascular disease
Disease that affects the heart and blood vessels.
Mentioned in: Lipoproteins Test
cardiovascular disease . Aldehydes are not just environmental contaminants, but are also naturally present in food and are intermediates in human metabolism. By identifying aldehyde-induced adducts and elucidating how they might influence protein function, the team hopes to characterize a novel mechanism involved in hypertension, stroke, and other forms of cardiovascular disease.
Pierce is also working on a project with university associate professor of medicine James Summersgill, investigating the interactions of the microorganism microorganism /mi·cro·or·gan·ism/ (-or´gah-nizm) a microscopic organism; those of medical interest include bacteria, fungi, and protozoa. Chlamydiapneumoniae with the cardiovascular system cardiovascular system: see circulatory system.
System of vessels that convey blood to and from tissues throughout the body, bringing nutrients and oxygen and removing wastes and carbon dioxide. . Just as Helicobacter pylori Helicobacter pylori
A gramnegative rod-shaped bacterium that lives in the tissues of the stomach and causes inflammation of the stomach lining.
Mentioned in: Indigestion, Ulcers
Helicobacter pylori has been implicated in gastric ulcers, there is a theory that microorganisms in the cardiovascular system could cause systemic infection, leading to plaque development and atherosclerosis. "We study the chlamydial chlamydial
pertaining to members of the family Chlamydiaceae.
abortion in cows, ewes, sows and goat does caused by Chlamydophila abortus and C. pecorum. See enzootic abortion of ewes. proteome and look at changes in it and how that might be reflected in the production of products that then stimulate atherosclerotic lesions," says Pierce.
Like all proteomics practitioners, he is enthusiastic about the possibilities that lie ahead in the field. "The infinite variety of states of proteins in the cell will give us the opportunity, more so than in genomics, to uncover new mechanisms in biology," he says. "In certain aspects you're looking at a dynamic system that is growing and changing, and we can actually 'catch biology happening.' And because of that, we'll find new mechanisms and be more likely to develop new ways to look at mechanisms or affect them."
NIEHS researchers are also delving into the field of proteomics. Merrick and the NCT Proteomics Group, working in partnership with Kenneth B. Tomer, who heads the NCT Mass Spectrometry Group, aid the center's efforts to discover more and better information about the adverse effects of chemicals and toxic compounds. Merricks group works mainly in expression proteomics experiments with animals. "We want to be able to evaluate the effects of chemicals in experimental animals under the most controlled conditions possible," he says, "so that we can separate out the true effects of the chemicals from the nonspecific nonspecific /non·spe·cif·ic/ (non?spi-sif´ik)
1. not due to any single known cause.
2. not directed against a particular agent, but rather having a general effect.
1. 'noise' effects that you always see with these types of technologies."
Among other projects, Merrick's group uses SELDI SELDI Surface Enhanced Laser Desorption/Ionization to examine the mechanisms of action of two key proteins involved in cell growth and cell death--p53 and NFKB. "p53 is often regarded as one of the 'master switches' of life and death and cell growth within the cell," says Merrick. "In the same sense, NFKB is a 'master switch' for inflammation and immune response immune response
An integrated bodily response to an antigen, especially one mediated by lymphocytes and involving recognition of antigens by specific antibodies or previously sensitized lymphocytes. . In these two proteins, we're looking for specific markers, specific states that would distinguish them in terms of their being activated or deactivated in association with a particular disease state or state of cellular function."
In the 3 April 2001 issue of Biochemistry, the Merrick and Tomer groups reported the results of their MS research on p53. They were able to isolate the entire protein from the cell for comprehensive MS analysis for the first time, in an effort to shed light on how the fine structure of the protein influences its ability to control cellular life and death. It has long been suspected that phosphorylation phosphorylation, chemical process in which a phosphate group is added to an organic molecule. In living cells phosphorylation is associated with respiration, which takes place in the cell's mitochondria, and photosynthesis, which takes place in the chloroplasts. , a type of post-translational modification, may be involved in the process. The group discovered six specific phosphorylation sites on the protein, one of which, Ser(315), was particularly phosphorylated. Unraveling the mystery of how p53 exerts its "master switch" control over cellular mortality would be an important advance in biology, and this study constitutes a major step toward that discovery.
The group is also undertaking a number of clinical projects in neurodegenerative disease Neurodegenerative disease
A disease in which the nervous system progressively and irreversibly deteriorates.
Mentioned in: Amnesia and cancer, taking advantage of access to blood and serum samples from NIEHS epidemiologic activities. Serum proteomic analysis has much to tell, according to Merrick. "The soluble proteins within serum or plasma can be reflective of a disease state, or of toxicity or injury to a particular disease site, whether it be in heart disease or liver disease Liver Disease Definition
Liver disease is a general term for any damage that reduces the functioning of the liver.
The liver is a large, solid organ located in the upper right-hand side of the abdomen. ," he explains. "So proteomics can shine in analysis of serum or plasma because of the nature of disease, in that you may either have release of a biomarker from a particular organ, or there may be indications of a repair process going on with serum or plasma that you can [detect in the bodily fluids]."
Toxicoproteomics can also be used to discover previously unknown sites within the cell, says Merrick. "When you're dealing with proteins, you're dealing with time and space," he says. "These proteins occupy a certain amount of space within tissues or cells, and to be able to isolate these subcellular portions that are important targets of either therapy or of toxicity is an area where proteomics can make a special contribution."
Calcium, Oxidation, and Aging
At the Pacific Northwest National Laboratory The Pacific Northwest National Laboratory (PNNL) is one of nine United States Department of Energy (DOE) multiprogram national laboratories. The laboratory
PNNL is located in Richland, Washington, and operates a marine research facility in Sequim, Washington. in Richland, Washington, researcher Thomas Squier practices proteomics as part of the laboratory's systems biology approach, which integrates information from all of the -omics disciplines to first determine how a cell functions and then develop predictive models. The lab's Biomolecular Systems Initiative, which includes its proteomics work, is part of the U.S. Department of Energy's Genomes to Life program, which aims at uncovering biologic solutions for major environmental issues such as clean energy production, removal of excess carbon dioxide carbon dioxide, chemical compound, CO2, a colorless, odorless, tasteless gas that is about one and one-half times as dense as air under ordinary conditions of temperature and pressure. from the atmosphere, and remediation of contaminated contaminated,
v 1. made radioactive by the addition of small quantities of radioactive material.
2. made contaminated by adding infective or radiographic materials.
3. an infective surface or object. environments.
Squier's group concentrates on analyzing calcium regulation in cells and how oxidative stress can trigger adaptive mechanisms, resulting in post-translational modifications to key calcium sensor proteins. Calcium maintains a 10,000-fold gradient in cellular systems and is the key player in the signaling that modulates energy metabolism. Changing calcium levels are responsible for much of the cell's sensing of the environment. By identifying post-translational modifications of the key calcium sensor proteins (changes such as methionine methionine (mĕthī`ənēn), organic compound, one of the 20 amino acids commonly found in animal proteins. Only the L-stereoisomer appears in mammalian protein. oxidation and protein nitration), potentially important new biomarkers of exposure can be isolated. For example, the lab has identified the calcium signaling protein calmodulin calmodulin /cal·mod·u·lin/ (kal-mod´u-lin) a calcium-binding protein present in all nucleated cells; it mediates a variety of cellular reponses to calcium.
n. as a major target of oxidative stress, as described in the January 2003 issue of Chemical Research in Toxicology. This discovery could contribute significantly to understanding of adaptive cellular responses to environmental exposures, particularly in how repair and maintenance systems are triggered.
Aging is an important factor in cellular adaptive ability as well. Aging is a major risk factor for most diseases and for sensitivity to environmental exposures, says Squier. "In aging," he says, "the key regulatory proteins regulatory proteins
1. proteins which regulate the contraction of muscle by controlling the interaction of myosin and actin. Calcium is an essential component of this reaction. The two proteins are troponin and tropomyosin.
2. get oxidized oxidized
having been modified by the process of oxidation.
see absorbable cellulose. , and their oxidation slows metabolism down. We speculate that this is an adaptive mechanism to maintain this balance between reactive species and cell function."
Ultimately, this work on the detection of post-translational modifications of key sensor proteins could lead to the development of microarray-based assays that would rapidly analyze a person's antioxidant antioxidant, substance that prevents or slows the breakdown of another substance by oxygen. Synthetic and natural antioxidants are used to slow the deterioration of gasoline and rubber, and such antioxidants as vitamin C (ascorbic acid), butylated hydroxytoluene status. In terms of applications, Squier says, being able to quickly identify changes in protein expression and discern what post-translational modifications happen is going to provide a very high level of information about the health of an individual, which in turn could lead to greatly enhanced medical treatment.
Proteomics Initiatives and Databases
Considering the enormous challenges and opportunities posed by proteomics, it's unsurprising that there are several collaborative proteomics initiatives under way at the national and international levels. Perhaps the best known of these campaigns is the Human Proteome Organisation (HUPO HUPO Human Proteome Organisation ), an international body intended to encourage large-scale analysis of the human proteome. HUPO seeks to consolidate national and regional proteome organizations into a worldwide research consortium. "Proteomics cannot be fully grasped and developed without a major international organized effort, which HUPO intends to facilitate," says Samir Hanash, the organization's president and a professor of pediatrics at the University of Michigan (body, education) University of Michigan - A large cosmopolitan university in the Midwest USA. Over 50000 students are enrolled at the University of Michigan's three campuses. The students come from 50 states and over 100 foreign countries. .
HUPO has established a goal of mapping 5,000 human proteins, and is coordinating and standardizing research in a variety of pertinent areas. Its major projects include analysis of specific regions of the body--the Human Plasma Proteome Project, the Human Liver Proteome Project, and the Human Brain Proteome Project. Another major project is the Proteomics Standards Initiative, which aims to define community standards for presentation of proteomics data. Still other projects include initiatives involving new proteomics technologies, cell models and tissue, bioinformatics, and the development of a collection of standardized, high-quality antibodies for every human protein. HUPO's shared resources, data, and establishment of standardized protocols and reporting guidelines should contribute substantially to the understanding of disease processes and chemical exposures.
The Human Proteomics Initiative seeks to comprehensively annotate annotate - annotation all known human proteins, which means parsing See parse.
parsing - parser out each protein's function, domain structure, subcellular location, post-translational modifications, variants, similarities to other proteins, and protein sequence polymorphisms. This ambitious project is sponsored by the Swiss Institute of Bioinformatics and the European Bioinformatics Institute The European Bioinformatics Institute (EBI) is a centre for research and services in bioinformatics, and is part of European Molecular Biology Laboratory (EMBL). It is a pioneer of novel and developmental bioinformatics research. , the keepers of one of the most widely used protein sequence databases, Swiss-Prot.
Virtually all proteomics experiments involve accessing and querying protein databases as an integral step in the process, allowing the identification and characterization of detected proteins and peptides. That vital link between data and knowledge should be greatly enhanced by the establishment of the United Protein Database, or UniProt. Funded in October 2002 by a three-year, $15 million grant subsidized primarily by the National Human Genome Research Institute along with five other institutes and centers of the NIH "Not invented here." See digispeak.
NIH - The United States National Institutes of Health. , UniProt will combine the resources of Swiss-Prot and two other major annotated protein databases, the European Bioinformatics Institute's TrEMBL and the Protein Information Resource's Protein Sequence Database. By January 2005, UniProt will be fully operational and available to all users free of charge.
In the world of proteomics, the main action often centers around interactions, molecular complexes, and pathways. The Biomolecular Interaction Network Database serves as a comprehensive, publicly accessible repository for data and software tools related to those critical biomolecular functions. The database is administered by blueprint WORLDWIDE, a nonprofit organization Nonprofit Organization
An association that is given tax-free status. Donations to a non-profit organization are often tax deductible as well.
Examples of non-profit organizations are charities, hospitals and schools. cofounded for that purpose by IBM (International Business Machines Corporation, Armonk, NY, www.ibm.com) The world's largest computer company. IBM's product lines include the S/390 mainframes (zSeries), AS/400 midrange business systems (iSeries), RS/6000 workstations and servers (pSeries), Intel-based servers (xSeries) and MDS MDS,
n See temporomandibular pain-dysfunction syndrome.
MDS 1 Maternal deprivation syndrome, see there 2 Myelodysplastic syndrome, see there Proteomics of Toronto, Canada.
Another important resource in the protein database arena has recently been launched as a joint project between researchers at The Johns Hopkins University Johns Hopkins University, mainly at Baltimore, Md. Johns Hopkins in 1867 had a group of his associates incorporated as the trustees of a university and a hospital, endowing each with $3.5 million. Daniel C. in Baltimore, Maryland, and the Institute of Bioinformatics in Bangalore, India. By the end of 2003, the Human Protein Reference Database This article reads like a news release, or is otherwise written in an overly promotional tone.
Please help [ rewrite this article] from a to be less promotional, per Wikipedia . is expected to contain comprehensive entries on 10,000 human proteins, including domain architecture, post-translational modifications, interaction networks, and disease associations. The information in this database has been manually extracted from the literature by biologists who read, interpret, and analyze the published data.
To spur the progress of clinical proteomics, in 2002 the National Heart, Lung, and Blood Institute National Heart, Lung, and Blood Institute,
n.pr established in 1948, this division of the National Institutes of Health is responsible for research and education on cardiovascular, pulmonary, systemic diseases, and sleep disorders. launched a major initiative that created 10 special centers of proteomics research at academic institutions across the country. The seven-year, $157 million program is designed to accelerate the development of innovative technologies to characterize healthy and diseased heart, lung, blood, and sleep processes. Says the institute's proteomic program administrator Susan Old, "This should speed the delivery of potential new clinical applications from research into practice." The centers will investigate protein profiling, interactions, and post-translational modifications as they relate to a variety of conditions, including cardiovascular disease, auto-immune disease, airway inflammation, and cystic fibrosis cystic fibrosis (sĭs`tĭk fībrō`sĭs), inherited disorder of the exocrine glands (see gland), affecting children and young people; median survival is 25 years in females and 30 years in males. .
The development of better tools and better knowledge of structural proteomics is the goal of the Protein Structure Initiative, a 10-year project funded by the National Institute of General Medical Sciences The U.S. National Institute of General Medical Sciences is one of the National Institutes of Health (NIH), the principal biomedical research agency of the Federal Government. and launched in 2000 with an open-ended budget. Currently in its pilot phase, the initiative aims to determine the three-dimensional structure of 10,000 unique proteins, while dramatically reducing the time and costs involved in the process. By 2005, each of nine centers is expected to be able solve the structure of 100-200 proteins annually. By grouping proteins into structural families, "the initiative will develop a catalog of all the protein structures that exist in nature," said Marvin Cassman, then director of the National Institute of General Medical Sciences, at the time the initiative was launched. "We expect that it will yield major biological findings that will improve our understanding of health and disease."
Proteomics data are also expected to play a large role in the Chemical Effects in Biological Systems (CEBS CEBS Committee of European Banking Supervisors
CEBS Certified Employee Benefit Specialist
CEBS Chemical Effects in Biological Systems
CEBS Church of England Boys Society
CEBS Charles Edward Brooke School (UK) ) knowledge base being developed by the NCT. CEBS is designed to exhaustively document the toxic effects of chemicals in the environment and will be fully searchable by compound, structure, toxicity, pathology, gene, gene group, single-nucleotide polymorphism polymorphism, of minerals, property of crystallizing in two or more distinct forms. Calcium carbonate is dimorphous (two forms), crystallizing as calcite or aragonite. Titanium dioxide is trimorphous; its three forms are brookite, anatase (or octahedrite), and rutile. , pathway, and network. The knowledge base will be accessible by the public, and will be a major contributor to progress in the fields of toxicoproteomics and toxicogenomics.
Scratch your average proteomics investigator and you will reveal an optimist just under the sober scientific surface. The excitement is palpable; the visions are grand. But all in the field agree that for proteomics to fulfill its lofty promise, certain key developments must take place, several of which are well on their way to fruition.
Technologic progress must continue and accelerate. Many in the field are anxious to see the replacement of the notoriously laborious 2DE method of protein separation with a more automated, high-throughput approach, such as antibody microarrays or isotope-coded affinity tags. All wish for further improvements and refinements in MS equipment and bioinformatics, as well as development of other technologies that could contribute to progress in the field.
"I think MS is becoming increasingly powerful, and we haven't yet realized the full power of these tools," says Liebler. "On the other hand, I think MS-based proteome analyses will give way to other kinds of less high-tech approaches, using perhaps some variants of array technologies: arrays of antibodies or aptamers [basic nucleic acid nucleic acid, any of a group of organic substances found in the chromosomes of living cells and viruses that play a central role in the storage and replication of hereditary information and in the expression of this information through protein synthesis. equivalents of antibodies] and perhaps small molecules that recognize proteins--little lab-on-a-chip devices that would be suitable for analysis of some components of proteins." There are a lot of competing technologies, he says, and it's hard to say what's going to work--"but if the last ten years have taught us anything, it's that we should be prepared to be surprised, regularly."
Philosophically, practitioners are confident that the knowledge gleaned from proteomics will ultimately converge and integrate with advances in the other--omics fields to evolve into a more holistic systems biology discipline with the ability to understand the processes and mechanisms of life in a truly global fashion. "We tend to think of ourselves as proteomics people, or genomics people, or lipids people, and in fact the cell and tissue only exist because all of these things are integrated," says Caprioli. "How these things all relate to one another is what's going to give us the key [to a more comprehensive understanding of systems biology]."
Researchers believe that proteomics will begin to make a tangible difference in medicine and environmental health quite soon. Merrick, for example, believes that within five years, there will be perhaps two or three key public databases that will offer access to gene and protein expression experiments that are done in a standardized way, and researchers will be able to query those databases for use in predicting human health responses to various environmental interactions. In 10 years, he says, "I believe that proteomics will be able to go right into the clinic, in terms of diagnostics and evaluation of blood and serum in a way that clinical chemistry can't approach or compete with. Typically, when you get blood drawn, you get maybe twenty or thirty analyses.... I think in the future this will just be dwarfed by the amount of useful information that will be derived at the proteomic level."
Petricoin is even less guardedly optimistic. He predicts that in five years "a patient will be able to have a pathophysiological portrait performed by high-throughput protein-based technologies that can read out hundreds of thousands of end points at once, and be able to provide the clinician a snapshot of what's going on What's Going On is a record by American soul singer Marvin Gaye. Released on May 21, 1971 (see 1971 in music), What's Going On reflected the beginning of a new trend in soul music. in that organism." Within a decade, he says, will come the development of high-throughput proteomics coupled with artificial intelligence-type systems, with nanotechnology, and even with nano-intelligence systems, allowing clinicians to harvest information and deliver tailored therapeutics based on what's happening in the serum, plasma, and tissue of any given patient who visits the doctor. "It's really going to revolutionize the way in which molecular medicine is performed," says Petricoin. "It's going to happen."
Groups and Initiatives
Human Proteome Organisation (HUPO) http://www.hupo.org/
An international research consortium intended to encourage large-scale analysis of the human proteome
Human Proteomics Initiative http://www.expasy.org/sprot/hpi/
Joint effort of the Swiss Institute of Bioinformatics and the European Bioinformatics Institute that seeks to comprehensively annotate all known human proteins
National Heart, Lung, and Blood Institute Proteomics Initiative
A seven-year, $157 million program to accelerate the development of innovative technologies to characterize healthy and diseased heart, lung, blood, and sleep processes in 10 special centers of proteomics research across the country
Protein Structure Initiative http://www.structuralgenomics.org/
A 10-year project funded by the National Institute of General Medical Sciences to determine the three-dimensional structures of 10,000 unique proteins, while dramatically reducing the time and costs involved in the process
Biomolecular Interaction Network Database http://www.blueprint.org/
A comprehensive, publicly accessible repository administered by blueprint WORLDWIDE for data and software tools related to critical biomolecular functions
Chemical Effects in Biological Systems Knowledge Base http://www.niehs.nih.gov/nct/cebs-htm
National Center for Toxicogenomics database that will exhaustively document the toxic effects of chemicals in the environment and be fully searchable by compound, structure, toxicity, pathology, gene, gene group, single-nucleotide polymorphism, pathway, and network
Human Protein Reference Database http://www.hprd.org/
Joint project of The Johns Hopkins University and the Institute of Bioinformatics that is expected to eventually contain comprehensive entries on 10,000 human proteins, including domain architecture, post-translational modifications, interaction networks, and disease associations
Protein Sequence Database http://pir.georgetown.edu/pirwww/dbinfo/pirpsd.html
A comprehensive annotated protein sequence database in the public domain, maintained by the Protein Information Resource, that contained more than 283,000 entries as of November 2003
A curated protein sequence database developed by the Swiss Institute of Bioinformatics and the European Bioinformatics Institute that strives to provide a high level of annotation, a minimal level of redundancy, and high level of integration with other databases
A database maintained by the European Bioinformatics Institute that contains the translations of all coding sequences present in the European Molecular Biology molecular biology, scientific study of the molecular basis of life processes, including cellular respiration, excretion, and reproduction. The term molecular biology was coined in 1938 by Warren Weaver, then director of the natural sciences program at the Rockefeller Laboratory's Nucleotide Sequence Database that are not yet integrated into Swiss-Prot
United Protein Database http://wvvw.uniprot.org/
With $15 million in funding from six NIH institutes and centers, will combine the resources of Swiss-Prot, TrEMBL, and the Protein Sequence Database.
RELATED ARTICLE: Pieces of the proteomics puzzle.
Proteomics encompasses several different subdisciplines, each with its own unique approach and its own contribution to the overall quest to glean knowledge from the proteome.
In expression (or profiling) proteomics, researchers seek to discover and quantify significant differences in the totality of expressed proteins between known samples--often diseased versus nondiseased or exposed versus unexposed. These differences appear as patterns that can have a very high degree of predictive power, whether the proteins in the pattern are identified (as some experts contend is necessary) or remain unidentified (which others argue is sufficient). Expression proteomics studies yield hypotheses that are then confirmed or refuted by other methods. Clinical proteomics investigations, which seek to apply proteomics knowledge directly to medical practice, typically employ expression proteomics methods.
Functional proteomics encompasses a wide variety of studies involving subsets of proteins. These studies seek to analyze and characterize specific functions, including signaling pathways, interactions, disease mechanisms, and biomarkers of disease or environmental exposures. In this field, hypotheses are tested rather than developed, and protein identification is vital to success.
Structural proteomics concentrates on mapping the structure of protein complexes or those proteins present in a specific cellular organelle organelle /or·ga·nelle/ (or?gah-nel´) a specialized structure of a cell, such as a mitochondrion, Golgi complex, lysosome, endoplasmic reticulum, ribosome, centriole, chloroplast, cilium, or flagellum. . Such information can provide valuable insights into cellular architecture, which greatly influences cellular function. X-ray crystallography and structural modeling by computational biology are the main methods utilized to unravel these extremely complicated systems.
In toxicoproteomics, the full range of proteomics methods and technologies are used in efforts to uncover the cellular and subcellular mechanisms at work in response to xenobiotic exposures. Researchers in this area are particularly interested in discovering biomarkers of exposure.