The challenge of constructing large phylogenetic trees

Michael Sanderson, Amy C. Driskell

Research output: Contribution to journalArticle

74 Citations (Scopus)

Abstract

The amount of sequence data available to reconstruct the evolutionary history of genes and species has increased 20-fold in the past decade. Consequently the size of phylogenetic analyses has grown as well, and phylogenetic methods, algorithms and their implementations have struggled to keep pace. Computational and other challenges raised by this burgeoning database emerge at several stages of analysis, from the optimal assembly of large data matrices from sequence databases, to the efficient construction of trees from these large matrices and the piece-wise assembly of 'supertrees' from those trees in turn. A final challenge is posed by the difficulty of visualizing and making inferences from trees that might soon routinely contain thousands of species.

Original languageEnglish (US)
Pages (from-to)374-379
Number of pages6
JournalTrends in Plant Science
Volume8
Issue number8
DOIs
StatePublished - Aug 1 2003
Externally publishedYes

Fingerprint

phylogeny
Databases
History
history
Genes
genes
methodology

ASJC Scopus subject areas

  • Genetics

Cite this

The challenge of constructing large phylogenetic trees. / Sanderson, Michael; Driskell, Amy C.

In: Trends in Plant Science, Vol. 8, No. 8, 01.08.2003, p. 374-379.

Research output: Contribution to journalArticle

Sanderson, Michael ; Driskell, Amy C. / The challenge of constructing large phylogenetic trees. In: Trends in Plant Science. 2003 ; Vol. 8, No. 8. pp. 374-379.
@article{84c149796eef4054b6e97765a40b30a1,
title = "The challenge of constructing large phylogenetic trees",
abstract = "The amount of sequence data available to reconstruct the evolutionary history of genes and species has increased 20-fold in the past decade. Consequently the size of phylogenetic analyses has grown as well, and phylogenetic methods, algorithms and their implementations have struggled to keep pace. Computational and other challenges raised by this burgeoning database emerge at several stages of analysis, from the optimal assembly of large data matrices from sequence databases, to the efficient construction of trees from these large matrices and the piece-wise assembly of 'supertrees' from those trees in turn. A final challenge is posed by the difficulty of visualizing and making inferences from trees that might soon routinely contain thousands of species.",
author = "Michael Sanderson and Driskell, {Amy C.}",
year = "2003",
month = "8",
day = "1",
doi = "10.1016/S1360-1385(03)00165-1",
language = "English (US)",
volume = "8",
pages = "374--379",
journal = "Trends in Plant Science",
issn = "1360-1385",
publisher = "Elsevier Limited",
number = "8",

}

TY - JOUR

T1 - The challenge of constructing large phylogenetic trees

AU - Sanderson, Michael

AU - Driskell, Amy C.

PY - 2003/8/1

Y1 - 2003/8/1

N2 - The amount of sequence data available to reconstruct the evolutionary history of genes and species has increased 20-fold in the past decade. Consequently the size of phylogenetic analyses has grown as well, and phylogenetic methods, algorithms and their implementations have struggled to keep pace. Computational and other challenges raised by this burgeoning database emerge at several stages of analysis, from the optimal assembly of large data matrices from sequence databases, to the efficient construction of trees from these large matrices and the piece-wise assembly of 'supertrees' from those trees in turn. A final challenge is posed by the difficulty of visualizing and making inferences from trees that might soon routinely contain thousands of species.

AB - The amount of sequence data available to reconstruct the evolutionary history of genes and species has increased 20-fold in the past decade. Consequently the size of phylogenetic analyses has grown as well, and phylogenetic methods, algorithms and their implementations have struggled to keep pace. Computational and other challenges raised by this burgeoning database emerge at several stages of analysis, from the optimal assembly of large data matrices from sequence databases, to the efficient construction of trees from these large matrices and the piece-wise assembly of 'supertrees' from those trees in turn. A final challenge is posed by the difficulty of visualizing and making inferences from trees that might soon routinely contain thousands of species.

UR - http://www.scopus.com/inward/record.url?scp=0042695888&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0042695888&partnerID=8YFLogxK

U2 - 10.1016/S1360-1385(03)00165-1

DO - 10.1016/S1360-1385(03)00165-1

M3 - Article

C2 - 12927970

AN - SCOPUS:0042695888

VL - 8

SP - 374

EP - 379

JO - Trends in Plant Science

JF - Trends in Plant Science

SN - 1360-1385

IS - 8

ER -