Loading...

Daily news and top headlines for life science research professionals

FREE Email Newsletter View Sample


Characterizing The Tree Of Life

Featured In: Genomics | Environmental | Health

Thursday, October 2, 2008

See today's top life science stories and headlines - Sign up now!

newsvine diigo google
slashdot
Share
Loading...
by Gina Shaw

Figure 1. Preparing samples for sequencing.
First, we sequenced the human genome. Now, the human proteome is firmly in the crosshairs of science, although since it is exponentially larger than the genome, it will take much longer for all initial sequencing to be done. Next on the agenda: how about the human microbiome?

Microbial cells outnumber human cells by a factor of about ten to one, but these microbial communities have been almost completely ignored by research. What do they do? How do they affect our growth and development, disease and wellness? With the exception of a few nasty pathogens like E. coli , the short answer is: we really don't know.

In fiscal 2007, the National Institutes of Health began funding projects aimed at building a data resource for the new Human Microbiome Project, including sequencing the genomes of 200 microbes that have been isolated from the human body. Four sequencing centers — at Baylor, the Broad Institute, the J. Craig Venter Institute, and Washington University — will generate the data for this part of the project.

The rise of metagenomics
Still in its infancy, the Human Microbiome Project will build on a foundation constructed largely by environmental scientists and microbiologists who have been studying microbial genomic diversity in situ , rather than by cultivation-based methods — a technique known as metagenomics.

Figure 2. A comparison view generated using MEGAN2, comparing metagenomic data from three ancient DNA studies published in the last couple of years: Mammoth, Cave Bear and Neanderthal. Click to enlarge.
Instead of using conventional DNA analysis — culturing identical cells in a lab — metagenomics sequences genetic material en masse from uncultured, environmental samples, allowing researchers to study organisms that are not easily cultured in laboratories (that's more than 99% of microbes), and revealing an amazing amount of microbial diversity that had previously gone unrecognized.

In the April 3, 2008 edition of Nature , Elizabeth Dinsdale et al., published a metagenomic mother lode — 15 million sequences from 45 distinct microbiomes and, for the first time, 42 distinct viromes. The multi-center team, based at San Diego State University, showed strongly discriminatory metabolic profiles across environments.

Another fascinating finding involved phage—viruses that infect bacteria. "Phage are even more numerous than bacteria, and our analysis showed that phage were also carrying a wider array of genes than first thought. That means because phage infect bacteria, put their DNA in and make bacteria replicate, they're introducing new genetic material into the bacteria population and may control the way bacteria are acting in a community. So metagenomic analysis tells us that the influence of phage is much greater than we first thought."

Massively-parallel pyrosequencing
What did Dinsdale's team used 454 Sequencing, the massively-parallel pyrosequencing system from 454 Life Sciences, to generate these findings.

"The biggest advantage is that we don't have to clone the sequences," says Dinsdale. "We take the DNA and it goes directly into sequencing without the cloning step, actually placing the sequences onto a bead and replicating them within an individual well. This saves time, which is hugely important given the amount of data we have — one sample is about 300,000 sequences." It took 3.5 weeks of computer time just to do the initial blast against the NCBI database for the 15 million sequences covered in the Nature paper.

The fact that the 454 system does not require cloning also allows Dinsdale and her colleagues to sequence pieces of the bacterial genome that just won't clone for various reasons. "There are some parts of the genome that you would never see from a cloned sequence," she says.

An often-quoted limitation of the 454 system, short sequence reads, doesn't bother Dinsdale. "First, their sequence reads have gotten longer, and our paper has shown we can obtain a good description of the microbial environment using a shorter sequence read in any case. Without having to grow the bacteria in many different cultures — a high carbon environment, a high silicon environment, and so on — we were able to obtain a fairly accurate picture of the microbial genes carried across the whole community, and interpret what sorts of functions they would be able to do."

Signature genes
Figure 3. MEGAN’s microbial attributes window. Click to enlarge.
The biggest barrier for most metagenomic researchers, Dinsdale says, is computing capacity. In less than a year since her team first ran their analysis of their data, the number of characterized metagenomes exploded from 87 to more than 800. "The amount of data will keep increasing, because it's such a window into the microbial world that hasn't been seen before."

Publicly accessible, computer-based tools aim to help with some of that data overload. One of them is Signature (http://www.cmbi.ru.nl/signature), a Web server that allows the input of genes from an unknown source, such as a metagenomics sample.

"The Web server then tells you if your input contains "signature genes" for clades in the tree of life, signature genes are only found in one clade, but within that clade are widely distributed," explains one of the lead developers of Signature, Bas E. Dutilh, of the Center for Molecular and Biomolecular Informatics at Nijmegen Center for Molecular Life Sciences, Radboud University, Nijmegen Medical Centre, in the Netherlands. "If you find many signature genes for a certain clade, it is likely that a species from that clade is represented in your sample."

The signature genes on which the server is based — 8,362 signature genes specific for 112 prokaryotic taxa — can answer phylogenetic questions on the basis of gene content even where complete genomes are not available. "By now, there are so many completed genomes available, that together, they have some predictive power for the ones that are not yet sequenced," explains Dutilh. If all the complete genomes that are available agree that a certain gene is a signature for a clade, then it is highly unlikely that we will find it in a species outside that clade. So conversely, if we find this signature gene in a certain species, then it is highly likely that it belongs in that clade. As the signature genes approach can be used for incomplete genomes, it opens the possibility of using the phylogenetic power in gene content in cases where we can not obtain a complete genome sequence, like for "un-culturable" species or metagenomic samples. In principle, finding one signature gene should already be enough."

Most methods that assess species distribution in a sequence sample do so on the basis of universal marker genes that contain a good phylogenetic signal. But, Dutilh notes, while there are only a few good phylogenetic marker genes, there are thousands of signature genes. "By finding which signature genes taxonomically identify a sample, Signature immediately pinpoints the genes that characterize the clade. So Signature can also be useful to find out what genes actually characterize a specific group of species."

For example, Dutilh's team has used the signature gene approach to analyze the genome of Kuenenia stuttgartiensis, a planctomycete that oxidizes ammonium anaerobically. By testing which clade shares most signature genes with Kuenenia and the other Planctomycetes, they discovered that they are most closely related to the Chlamydiae "One of the signature genes we discovered for this Planctomycetes-Chlamydiae superphylum is a gene that is highly similar to the 60-kDa cysteine-rich outer membrane protein of Chlamydiae, and they share more properties of their cell envelopes," says Dutilh." So Signature is not only useful to find the taxonomic relatives of a new species, but it also immediately identifies those genes that are characteristic of the clade."

Analysis of metagenomic datasets
A complementary tool is MEGAN (MEtaGenome ANalyzer), a computer program that allows optimized analysis of large metagenomic datasets in order to identify the species they contain. Co-developers Daniel Huson, of the Center for Bioinformatics at Tübingen University in Germany, and Stephan Schuster, of Penn State University, designed MEGAN to compare DNA sequence fragments from an environmental sample with gene sequences from GenBAnk in NCBI.

Fragments of DNA from an environmental sample, such as ocean waters or soil, are compared against databases of known DNA sequences using BLAST or another algorithmic bioinformatics tool to assemble the segments into discrete comparable sequences. MEGAN is then used to compare the resulting sequences with gene sequences from GenBank in NCBI. Originally designed to investigate the DNA of a mammoth found in the Siberian permafrost, and described in a 2007 paper in Genome Research MEGAN has since garnered hundreds of registered users. "The main use is probably to obtain a first analysis of a new dataset, which is then subsequently followed up by more detailed analyses. In some sequencing centers, MEGAN is routinely used to check whether a sequence project contains the types of sequences that it supposed to do," says Huson.

Huson and his team are now at work on MEGAN 2.0, which will aim at comparative analysis of multiple datasets. "Also, we are about to release a companion program MetaSim, that can be used to simulate a metagenome sequencing project, controlling both the exact mix of genomes and also the type of sequencing technology used," he says.

Analysis of metagenomic data sets is still in its infancy, says Huson. "So a tool like MEGAN, which aims particularly at interactive exploration of data, rather than trying to cook a final answer, will help people to discover and understand how such an analysis should best be done."

A long way to go
The main message of metagenomics so far, according to Huson, is that the tree of life has a lot more branches than we thought. "Only about 300,000 species are represented by at least one gene sequence in the NCBI databases; however, nearly two million species have already been named and metagenome projects are confirming that the true spectrum of different types of organisms is probably much, much larger.
Join the Discussion
Rate Article:  Average 0 out of 5
register or log in to comment on this article!

0 Comments

Add Comment

Text Only 2000 character limit

Page 1 of 1

Research Exchange

Three Tips to Help Manage Customization in Genomics LIMS Implementations

Nov 21 2011

Labs must consider three elements in evaluating genomics laboratory information management system (LIMS) to support the ever-changing workflow characteristics of next-generation sequencing.

How to Build an Integrated Microscopy System for Live Cell Mechanotransduction Studies

How to Build an Integrated Microscopy System for Live Cell Mechanotransduction Studies

Aug 2 2011

A new integrated microscopy system allows scientists to simultaneously stimulate and image live cell response in real-time.

Tips for Reducing Static Electricity

Tips for Reducing Static Electricity

Aug 1 2011

Static electricity can affect automated instrumentation within the lab.

Microscopes for the Non-Microscopist: Multidisciplinary Research Using Optical Imaging

Microscopes for the Non-Microscopist: Multidisciplinary Research Using Optical Imaging

Jul 5 2011

High quality microscopy is increasingly used by scientists in new areas of research.

Evaluation of a New Nano-Type UV-Vis Spectrophotometer

Mar 3 2011

Analysis of one- to four-microliter size samples for nucleic acids has become routine in many life science laboratories. However, until now, available instruments require considerable manipulation of the instrument and sample; some require manually recording the data. The user must typically lower and raise the arm manually, then wipe the sample manually from the target after each analysis. And fiberoptics used in some of these instruments are subject to deterioration.

Production of Recombinant Proteins and Monoclonal Antibodies in Hollow Fiber Bioreactors

Jan 25 2011

While well-understood, robust and convenient, classical batch-style 2-D culture on non-porous supports or 3-D suspension culture in other devices are really not very biologically relevant models. Cell culture conditions can affect the quality of the antibody or protein produced.

Selecting Robots for Use in Drug Discovery and Testing

Dec 6 2010

Drug discovery and testing, with their need for speed, repeatability and verification, are ideally suited to benefit from robot automation. It is therefore not surprising that robots have been at the forefront of automation developments in both these areas.

HP Scalable Network Storage Systems for Life Sciences

Sep 13 2010

Life sciences research today is advancing exponentially, each step bringing us closer to the realization of truly personalized medicine–preventive care and treatments designed specifically for each individual. In the near future, PCPGM healthcare researchers expect to be able to use predictive genetic testing to create custom treatment plans for individuals and deliver dramatic improvements over today’s one-size-fits-all approach. But research capabilities are only part of the equation; current storage and operating capacities must also evolve to accommodate ever-expanding amounts of data before the goal of personalized medicine can be realized.

Using the Tecan Genesis Workstation to Automate a Cytometric Bead Array (CBA) Immunoassay

Mar 11 2010

The poster describe the process involved in automating a Cytometric Bead Array (CBA) immunoassay developed to measure relative concentrations of serum antibodies against Tetanus (TT), Sperm Whale Myoglobin (SWM) and Keyhole Limpet Hemocyanin (KLH) in KLH-immunized volunteers.

Ensuring Quality in Assays Performed with Automated Liquid Handlers

Feb 2 2010

The focus of this presentation is to highlight the need of ensuring quality in important assays performed with automated liquid handlers. Nearly all assays performed within a laboratory are volume-dependent. In turn, all concentrations of biological and chemical components in these assays, as well as the associated dilution protocols, are volume-dependent. Because analyte concentration is volume-dependent, an assay’s results might be falsely interpreted if liquid handler variability and inaccuracies are unknown or if the system(s) go unchecked for a long period.

Inkjet System for Protein Crystallography

Feb 1 2010

X-ray crystallography is used routinely by scientists to obtain the three dimensional structure of a biological molecule of interest.Such information can be used to determine how a pharmaceutical interacts with a protein target and what changes might improve functionality. However, the crystallization of macromolecules still remains a serious hindrance in structural determination despite impressive advances in screening methods and technologies.

Attention Deficit & Hyperactivity in a Drosophila Memory Mutant

Attention Deficit & Hyperactivity in a Drosophila Memory Mutant

Nov 9 2009

Action selection is modulated by external stimuli either directly or via memory retrieval. In a constantly changing environment, animals have evolved attention-like processes to effectively filter the incoming sensory stream. These attention-like processes, in turn, are modulated by memory. The neurobiological nature of how attention, action selection and memory are inter-connected is unknown. We describe here new phenotypes of the memory mutant radish in the fruit fly Drosophila.

Red Meat Consumption and Mortality: Results From 2 Prospective Cohort Studies

Mar 16

BACKGROUND: Red meat consumption has been associated with an increased risk of chronic diseases. However, its relationship with mortality remains uncertain. METHODS: We prospectively observed 37 698 men from the Health Professionals Follow-up Study (1986-2008)...

Structural analysis of eyespots: dynamics of morphogenic signals that govern elemental positions in butterfly wings.

Mar 15

ABSTRACT: BACKGROUND: To explain eyespot colour-pattern determination in butterfly wings, the induction model has been discussed based on colour-pattern analyses of various butterfly eyespots. However, a detailed structural analysis of eyespots that can serve as...

Prokariotic Cell Collection in Denmark

Nov 6 2009

I would like to know about a prokariotic cell collection in Denmark. Is there a cell bank in this country? I need a Lactobacillus strain for a fermentation assay and this information about the bank is very helpful for me.

Request for Entries

Oct 16 2009

Ask the Experts is your chance to get the answers to questions on applications, materials, methods, processes, and technologies. Email you question to bst_web@advantagemedia.com, and the editors of Bioscience Technology will find an appropriate expert to answer it. Watch this space in the future to see the questions your colleagues are posting.          

STAY INFORMED: SUBSCRIBE TO

Magazine and E-mail Newsletters

Loading...
Loading...

Free Life Science Industry
Subscriptions

Magazine

wireless week

Newsletters

newsletters

Sign up now



MULTIMEDIA

Video:

Viewing SureFocus Slides

Jun 11 2010

A demonstration of SureFocus Microscope Slides in the review of AFB Smears. SureFocus Slides are a patent-pending breakthrough in tuberculosis detection, as their fluorescent staining circle remains visible during review, Fluorescence Microscopy.

Podcasts:

Allen Institute for Brain Research

Allen Institute for Brain Research

Oct 14 2009

Discussed in this interview are both the mouse brain project and the human cortex project with an emphasis on the importance of these projects to neuroscience research.

Top Stories and Headlines
EVERY DAY!

FREE Email Newsletter