Title: The Evolution of Microbes and Their Genomes: A Phylogenomic Perspective
1The Evolution of Microbes and Their Genomes A
Phylogenomic Perspective
- Jonathan A. Eisen
- October 3, 2009
2Nothing in biology makes sense except in the
light of evolution. T. Dobzhansky (1973)
3Eisen Lab - Phylogenomics of Novelty
Origin of New Functions and Processes
Genome Dynamics
- Evolvability
- Repair and recombination processes
- Intragenomic variation
- New genes
- Changes in old genes
- Changes in pathways
Species Evolution
- Phylogenetic history
- Vertical vs. horizontal descent
- Needed to track gain/loss of processes, infer
convergence
4Phylogenomic Analysis
- Evolutionary reconstructions greatly improve
genome analyses - Genome analysis greatly improves evolutionary
reconstructions - There is a feedback loop such that these should
be integrated
556.1 The Global Ecosystems Compartments are
Connected by the Flow of Elements
626.23 Some Would Call It Hell These Archaea
Call It Home
7(No Transcript)
8Human commensals
9Deep Sea Ecosystems
10Fleischmann et al. 1995
11Whole Genome Shotgun Sequencing
12Assemble Fragments
13From http//genomesonline.org
14Major Microbial Sequencing Efforts
- Coordinated, top-down efforts
- Fungal Genome Initiative (Broad/Whitehead)
- Gordon and Betty Moore Foundation Marine
Microbial Genome Sequencing Project - Sanger Center Pathogen Sequencing Unit
- NHGRI Human Gut Microbiome Project
- NIH Human Microbiome Program
- White paper or grant systems
- NIAID Microbial Sequencing Centers
- DOE/JGI Community Sequencing Program
- DOE/JGI BER Sequencing Program
- NSF/USDA Microbial Genome Sequencing
- Covers lots of ground and biological diversity
15Genome Sequences Have Revolutionized Microbiology
- Predictions of metabolic processes
- Better vaccine and drug design
- New insights into mechanisms of evolution
- Genomes serve as template for functional studies
- New enzymes and materials for engineering and
synthetic biology
1616
1717
1818
19rRNA Tree of Life
20The Tree is not Happy
21As of 2002
- At least 40 phyla of bacteria
Acidobacteria
Bacteroides
Fibrobacteres
Gemmimonas
Verrucomicrobia
Planctomycetes
Chloroflexi
Based on Hugenholtz, 2002
22As of 2002
- At least 40 phyla of bacteria
- Genome sequences are mostly from three phyla
Acidobacteria
Bacteroides
Fibrobacteres
Gemmimonas
Verrucomicrobia
Planctomycetes
Chloroflexi
Based on Hugenholtz, 2002
23As of 2002
- At least 40 phyla of bacteria
- Genome sequences are mostly from three phyla
- Some other phyla are only sparsely sampled
Acidobacteria
Bacteroides
Fibrobacteres
Gemmimonas
Verrucomicrobia
Planctomycetes
Chloroflexi
Based on Hugenholtz, 2002
24As of 2002
- At least 40 phyla of bacteria
- Genome sequences are mostly from three phyla
- Some other phyla are only sparsely sampled
- Same trend in Archaea
Acidobacteria
Bacteroides
Fibrobacteres
Gemmimonas
Verrucomicrobia
Planctomycetes
Chloroflexi
Based on Hugenholtz, 2002
25Need for Tree Guidance Well Established
- Common approach within some eukaryotic groups
- Many small projects funded to fill in some
bacterial or archaeal gaps - Phylogenetic gaps in bacterial and archaeal
projects commonly lamented in literature
26- NSF-funded Tree of Life Project
- A genome from each of eight phyla
- At least 40 phyla of bacteria
- Genome sequences are mostly from three phyla
- Some other phyla are only sparsely sampled
- Solution I sequence more phyla
Acidobacteria
Bacteroides
Fibrobacteres
Gemmimonas
Verrucomicrobia
Planctomycetes
Eisen, Ward, Badger, Wu, Wu, et al.
Chloroflexi
27The Tree of Life is Still Angry
28Major Lineages of Actinobacteria
29- At least 100 phyla of bacteria
- Genome sequences are mostly from three phyla
- Most phyla with cultured species are sparsely
sampled - Lineages with no cultured taxa even more poorly
sampled - Solution - use tree to really fill gaps
Acidobacteria
Bacteroides
Fibrobacteres
Gemmimonas
Verrucomicrobia
Planctomycetes
Chloroflexi
Well sampled phyla
30http//www.jgi.doe.gov/programs/GEBA/pilot.html
31GEBA Pilot Project Overview
- Identify major branches in rRNA tree for which no
genomes are available - Identify a cultured representative for each group
- Grow gt 200 of these and prep. DNA
- Sequence and finish 100
- Annotate, analyze, release data
- Assess benefits of tree guided sequencing
32(No Transcript)
33Some Lessons From GEBA
34GEBA Lesson 1
- rRNA Tree of Life is a Useful Guide and Genomes
Improve Resolution
35rRNA Tree of Life
36(No Transcript)
37(No Transcript)
38GEBA Lesson 2
- Phylogenetically Guided Selection Can Help
Annotate Other Genomes
39Predicting Function
- Identification of motifs
- Small sequence elements that code for some
general activity (e.g., ATPase) - Homology/similarity based methods
- Identify longer stretches of similarity to genes
with known function - Evolutionary reconstructions
40Evolutionary Functional Prediction
Based on Eisen, 1998 Genome Res 8 163-167.
41Recent Functional Changes
- Phylogenomic functional prediction may not work
well for very newly evolved functions - Screen genomes for genes that have changed
recently - Examples
- Acquisition (e.g., LGT)
- Unusual dS/dN ratios
- Rapid evolutionary rates
- Duplication and divergence
42Non homology functional prediction
- Many genes have homologs in other species but no
homologs have ever been studied experimentally - Non-homology methods can make functional
predictions for these - Example correlated presence/absence of genes
across species
43Most/All Functional Prediction Improves w/ Better
Phylogenetic Sampling
- Better definition of protein family sequence
patterns - Greatly improves comparative and evolutionary
based predictions - Conversion of hypothetical into conserved
hypotheticals - Linking distantly related members of protein
families - Improved non-homology prediction
44GEBA Lesson 3
- Phylogenetically Guided Selection Can Help Study
Uncultured Organisms
45Great Plate Count Anomaly
Microscope
Culturing
Count
Count
46Great Plate Count Anomaly
Microscope
Culturing
Count
Count
ltltltlt
47Great Plate Count Anomaly
DNA
Microscope
Culturing
Count
Count
ltltltlt
48PCR Saves the Day
49Sequencing and uncultured microbes I rRNA
surveys
50rRNA A Phylogenetic Anchor to Determine Whos
Out There
Biology not similar enough
Eisen et al. 1992
51Perna et al. 2003
52Environmental Shotgun Sequencing
53Novel Form of Phototrophy
Beja et al. 2000
54(No Transcript)
55Binning challenge
A B C D E F G
T U V W X Y Z
56Glassy Winged Sharpshooter
- Feeds on xylem sap
- Vector for Pierces Disease
- Potential bioterror agent
57Xylem and Phloem
From Lodish et al. 2000
58Wu et al. 2006 PLoS Biology 4 e188.
59Sharpshooter Shotgun Sequencing
shotgun
Collaboration with Nancy Morans lab
Wu et al. 2006 PLoS Biology 4 e188.
60Sulcia makes vitamins and cofactors
Baumannia makes amino acids
Wu et al. 2006 PLoS Biology 4 e188.
61Binning challenge
A B C D E F G
T U V W X Y Z
62(No Transcript)
63GEBA Lesson 4
- We have still only scratched the surface of
microbial diversity
64Protein Family Rarefaction Curves
- Take data set of multiple complete genomes
- Identify all protein families using MCL
- Plot of genomes vs. of protein families
65(No Transcript)
66MobileMotilityElement?
67Phylogenetic Distribution Novelty 1st Bacterial
Actin Related Protein
Haliangium ochraceum DSM 14365
68Phylogenetic Diversity Sequenced Bacteria
Archaea
69Phylogenetic Diversity with GEBA
70Phylogenetic Diversity Isolates
71Phylogenetic Diversity All
72- At least 40 phyla of bacteria
- Genome sequences are mostly from three phyla
- Most phyla with cultured species are sparsely
sampled - Lineages with no cultured taxa even more poorly
sampled
Acidobacteria
Bacteroides
Fibrobacteres
Gemmimonas
Verrucomicrobia
Planctomycetes
Chloroflexi
Well sampled phyla
Poorly sampled
No cultured taxa
73Uncultured LineagesTechnical Approaches
- Get into culture
- Enrichment cultures
- If abundant in low diversity ecosystems
- Flow sorting
- Microbeads
- Microfluidic sorting
- Single cell amplification
74GEBA Lesson 6
- Need Experiments from Across the Tree of Life too
75As of 2002
- At least 40 phyla of bacteria
Acidobacteria
Bacteroides
Fibrobacteres
Gemmimonas
Verrucomicrobia
Planctomycetes
Chloroflexi
Based on Hugenholtz, 2002
76As of 2002
- At least 40 phyla of bacteria
- Experimental studies are mostly from three phyla
Acidobacteria
Bacteroides
Fibrobacteres
Gemmimonas
Verrucomicrobia
Planctomycetes
Chloroflexi
Based on Hugenholtz, 2002
77As of 2002
- At least 40 phyla of bacteria
- Experimental studies are mostly from three phyla
- Some studies in other phyla
Acidobacteria
Bacteroides
Fibrobacteres
Gemmimonas
Verrucomicrobia
Planctomycetes
Chloroflexi
Based on Hugenholtz, 2002
78As of 2002
- At least 40 phyla of bacteria
- Genome sequences are mostly from three phyla
- Some other phyla are only sparsely sampled
- Same trend in Eukaryotes
Acidobacteria
Bacteroides
Fibrobacteres
Gemmimonas
Verrucomicrobia
Planctomycetes
Chloroflexi
Based on Hugenholtz, 2002
79As of 2002
- At least 40 phyla of bacteria
- Genome sequences are mostly from three phyla
- Some other phyla are only sparsely sampled
- Same trend in Viruses
Acidobacteria
Bacteroides
Fibrobacteres
Gemmimonas
Verrucomicrobia
Planctomycetes
Chloroflexi
Based on Hugenholtz, 2002
80Need experimental studies from across the tree too
Acidobacteria
Bacteroides
Fibrobacteres
Gemmimonas
Verrucomicrobia
Planctomycetes
0.1
Chloroflexi
Tree based on
Hugenholtz (2002)
with some
modifications.
81(No Transcript)
82MICROBES
83A Happy Tree of Life
84GEBA Pilot Project Components
- Project overview (Phil Hugenholtz, Nikos
Kyrpides, Jonathan Eisen) - Project management (David Bruce, Lynne Goodwin et
al) - Culture collection and DNA prep (DSMZ,
Hans-Peter Klenk) - Libraries and DNA (Eileen Dalin et al)
- Sequencing and closure (Susan Lucas, Alla Lapidus
et al.) - Annotation and data release (Nikos Kyrpides)
- Analysis (Dongying Wu, Kostas Mavrommatis, Martin
Wu, Jenna Morgan, Victor Kunin, Marcel Huntemann,
Neil Rawlings, Ian Paulsen, Patrick Chain, Patrik
DHaeseleer, Sean Hooper, Iain Anderson) - Adopt a microbe education project (Cheryl
Kerfeld) - Outreach (David Gilbert)
- (DOE, Eddy Rubin, Jim Bristow)