1. The document discusses the Open Tree of Life project, which aims to synthesize a complete draft tree of life from existing phylogenetic trees.
2. The project will release the tree with annotations, areas of uncertainty, links to source data, and tools to download subtrees. It will also automatically update the tree with new sequence data.
3. The goal is to provide a complete phylogenetic framework and link it to biodiversity and systematics content through an open, community-driven effort.
1 of 29
More Related Content
If this is the future, where is my tree of life?
1. IF THIS IS THE FUTURE,
WHERE IS MY TREE OF LIFE?
Karen Cranston
National Evolutionary Synthesis Center (NESCent)
@kcranstn
http://www.slideshare.net/kcranstn
2. Tree of life
~2million named
species
Millions
more
unnamed / undiscovered
7. Archiving sequence data is a
community norm
~4% of all published
phylogenetic trees
Stoltzfus et al 2012
8. Publishing a tree
= picture in a PDF
EVOLUTION
Fig. 1. Combined molecular phylogenetic tree for Diptera. Partitioned ML analysis of combined taxon sets of tier 1 and tier 2 FLYTREE data samples (lnL =
Weigmann et al. PNAS, 2011
344155.6169) calculated in RAxML. Circles indicate bootstrap support >80% (black/bp = 95100%, gray/bp = 8894%, white/bp = 8088%). Nodes with im-
proved bootstrap values resulting from postanalysis pruning of unstable taxa are marked by stars (black/bp = 95100%, gray/bp = 8894%, white/bp = 80
88%). Colored squares on terminal branches indicate the presence, in at least one species of a family, of ecological traits as shown to lower left. The number
of origins of each trait was estimated with reference to the phylogeny, the distribution of each trait among genera within a family, and the known biology of
the organisms.
thermore, a paraphyletic relationship of phorids and syrphids To test this hypothesis, we used a relatively recent phylogenomic
would support the hypothesis that their shared special mode of marker: small, noncoding, regulatory micro-RNAs (miRNAs).
11. Fig. 1. Combined molecular phylogenetic tree for Diptera. Partitioned ML analysis of combined taxon sets of tier 1 and tier 2 FLYTREE data samples (lnL =
344155.6169) calculated in RAxML. Circles indicate bootstrap support >80% (black/bp = 95100%, gray/bp = 8894%, white/bp = 8088%). Nodes with im-
proved bootstrap values resulting from postanalysis pruning of unstable taxa are marked by stars (black/bp = 95100%, gray/bp = 8894%, white/bp = 80
88%). Colored squares on terminal branches indicate the presence, in at least one species of a family, of ecological traits as shown to lower left. The number
of origins of each trait was estimated with reference to the phylogeny, the distribution of each trait among genera within a family, and the known biology of
the organisms.
thermore, a paraphyletic relationship of phorids and syrphids To test this hypothesis, we used a relatively recent phylogenomic
would support the hypothesis that their shared special mode of marker: small, noncoding, regulatory micro-RNAs (miRNAs).
extraembryonic development (dorsal amnion closure) (26) miRNAs exhibit a striking phylogenetic pattern of conservation
evolved in the stem lineage of Cyclorrhapha and preceded the across the metazoan tree of life, suggesting the accumulation and
origin of the schizophoran amnioserosa. maintenance of miRNA families throughout organismal evolution
Wiegmann et al. PNAS Early Edition | 3 of 6
12. assembly
alignment
inference
expertise Fig. 1. Combined molecular phylogenetic tree for Diptera. Partitioned ML analysis of combined taxon sets of tier 1 and tier 2 FLYTREE data samples (lnL =
344155.6169) calculated in RAxML. Circles indicate bootstrap support >80% (black/bp = 95100%, gray/bp = 8894%, white/bp = 8088%). Nodes with im-
proved bootstrap values resulting from postanalysis pruning of unstable taxa are marked by stars (black/bp = 95100%, gray/bp = 8894%, white/bp = 80
88%). Colored squares on terminal branches indicate the presence, in at least one species of a family, of ecological traits as shown to lower left. The number
of origins of each trait was estimated with reference to the phylogeny, the distribution of each trait among genera within a family, and the known biology of
the organisms.
time thermore, a paraphyletic relationship of phorids and syrphids
would support the hypothesis that their shared special mode of
extraembryonic development (dorsal amnion closure) (26)
evolved in the stem lineage of Cyclorrhapha and preceded the
origin of the schizophoran amnioserosa.
To test this hypothesis, we used a relatively recent phylogenomic
marker: small, noncoding, regulatory micro-RNAs (miRNAs).
miRNAs exhibit a striking phylogenetic pattern of conservation
across the metazoan tree of life, suggesting the accumulation and
maintenance of miRNA families throughout organismal evolution
$$$ Wiegmann et al. PNAS Early Edition | 3 of 6
14. NSF IDEAS LAB
i. Pre-proposal / application iv. Pitch high risk proposal
ideas at end
ii. 5 day highly facilitated
workshop v. NSF invited full proposals
iii. Self-assembly into groups
15. Community assembly of the
tree of life (Open Tree of Life)
Next generation Phenomics
(PI OLeary)
Arbor: Comparative Analysis
Work鍖ows (PI Harmon)
16. Karen Cranston, lead PI (Duke)
Gordon Burleigh (Florida)
Keith Crandall (BYU)
Karl Gude (MSU)
David Hibbett (Clark)
Mark Holder (Kansas)
Laura Katz (Smith)
opentreeo鍖ife.org Rick Ree (FMNH)
Stephen Smith (Michigan)
Doug Soltis (Florida)
Tiffani Williams (TAMU)
AVAToL: Assembling, Visualizing and Analysis of
the Tree of Life
17. 1. Synthesize a complete draft tree of life from existing
phylogenetic trees
18. 1. Synthesize a complete draft tree of life from existing
phylogenetic trees
2. Release with:
a. ability to add annotations and upload new data sets
b. areas of uncertainty / con鍖ict
c. links to source data and analysis methods
d. utilities to download whole tree and subtrees
20. Graph database holding
thousands of input trees with 鍖lter / weight input trees
millions of nodes
build synthetic trees
compare to alternate trees
input new data sets
22. Dipsicales graph Synthesized tree (favouring
phylogenetic branches); contains
all 578 taxa
23. AUTOMATIC UPDATING
update trees
with new
sequence data
detect and synthesize newly
published trees
24. ?
Open Data
increasing
availability of digital data associated with
phylogeny publications
synthetic
tree open to community annotation and
new data submission
whole tree / subtrees available for download
25. ?
Open Science
project wiki: http://opentree.wikispaces.com/
open source software: https://github.com/OpenTreeOfLife
public mailing list, meeting notes, management tools
27. provide complete phylogenetic
framework
link to biodiversity and systematics
content
API for downloading subtrees to analysis tools
source / storage of underlying data
28. opentreeo鍖ife.org
Weve only just started (June 1 2012)
Open to input, feedback and participation:
join the mailing list & wiki
add publications to the Mendeley group
vote / comment on plans on the development boards
participate in virtual data curation sprint in 2013
29. Karen Cranston, lead PI (Duke)
Gordon Burleigh (Florida)
Keith Crandall (BYU)
Karl Gude (MSU)
David Hibbett (Clark)
Mark Holder (Kansas)
Laura Katz (Smith)
opentreeo鍖ife.org Rick Ree (FMNH)
Stephen Smith (Michigan)
Doug Soltis (Florida)
Tiffani Williams (TAMU)
AVAToL: Assembling, Visualizing and Analysis of the Tree of Life